1 UF-221C1XCZ1 
DESCRIPTION 

MATERIALS AND METHODS FOR DELIVERY AND EXPRESSION OF 
HETEROLOGOUS DNA IN VERTEBRATE CELLS 

5 

The subject invention was made with government support under a research project 
supported by U.S. Department of Agriculture Grant No. 97-35302-443 1 and National Institute of 
Health Grant No. P50-HL59412-01 . The government has certain rights in this invention. 

10 Cross-Reference to a Related Application 

This application is a continuation of U.S. application Serial No. 09/662,254, filed 
September 14, 2000, which claims the benefit of U.S. provisional patent application Serial No. 
60/224,479, filed August 10, 2000; and which is a continuation-in-part of U.S. application Serial 
No. 09/086,651, filed May 29, 1998, now U.S. Patent No. 6,127,172. 

15 

Background of the Invention 
Gene therapy is a powerful concept just now beginning to see applications designed to 
treat human diseases such as genetic disorders and cancer. The introduction of genes into an 
organism can be achieved in a variety of ways, including virus-based vectors. Viral gene therapy 
20 vectors can either be designed to deliver and express genes permanently (stable integration of a 
foreign gene into host chromosome) or transiently (for a finite period of time). 

Current virus-based gene transfer vectors are typically derived from animal viruses, such 
as retroviruses, herpesviruses, adenoviruses, or adeno-associated viruses. Generally, these 
viruses are engineered to remove one or more genes of the virus. These genes may be removed 
25 because they are involved in viral replication and/or to provide the capacity for insertion and 
packaging of foreign genes. Each of these known vectors has some unique advantages as well as 
disadvantages. One primary disadvantage is an inability to readily package and deliver large 
DNA inserts that are greater than 10 kb in size. 

To illustrate the problem of capacity of most gene therapy vectors, one need only 
30 consider adeno-associated virus (AAV), one of the most promising of the gene therapy vectors. 
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Adeno-associated virus (AAV) is a parvovirus which consists of a 4.7 kb single stranded DNA 
genome (Nienhuis, A.W., C.E. Walsh, J.M. Liu [1993] "Viruses as therapeutic gene transfer 
vectors" In: N.S. Young (ed.) Viruses and Bone Marrow, Marcel Decker, New York, pp. 353- 
414). The viral genome consists of the family of rep genes responsible for regulatory function 
5 and DNA replication and the cap genes that encode the capsid proteins. The AAV coding region 
is flanked by 145 nucleotide inverted terminal repeat (ITR) sequences which are the minimum 
cis-acting elements essential for replication and encapsidation of the genome. In the absence of a 
helper virus such as adenovirus, AAV causes a latent infection characterized by the integration of 
viral DNA into the cellular genome. The major advantages of recombinant AAV (rAAV) 

10 vectors include a lack of pathogenicity in humans (Berns, K.L and R.A. Bohenzky [1987] 
"Adeno-associated viruses: an update" Adv. Virus Res. 32:243-306), the ability of wild-type AAV 
to integrate stably into the long arm of chromosome 19 (Kotin, R.M., R.M. Linden, K.I. Berns 
[ 1 992] "Characterization of a preferred site on human chromosome 1 Oq for integration of adeno- 
associated virus DNA by nonhomologous recombination" EMBOJl 1 :5071-5078), the potential 

15 ability to infect nondividing cells (Kaplitt et al [1994] "Long term gene expression and 
phenotypic correction using adeno-associated virus vectors in the mammalian brain" Nat. Genet. 
8: 148-1 54), and broad range of infectivity. However, the packaging capacity of AAV limits the 
size of the inserted heterologous DNA to about 4.7 kb. Gene therapy vector systems are also 
needed that combine a large carrying capacity with high transduction efficiency in vivo. 

20 Until recently, complex insect viruses (entomo viruses) had not been considered for use as 

possible viral gene therapy vectors. In the past, studies of entomoviruses have mainly 
concentrated on their use as biopesticides, expression systems or taxonomic novelties to compare 
to their mammalian virus counterparts. 

The family Poxviridae comprises two subfamilies, the Chordopoxviridae (vertebrate) and 

25 the Entomopoxviridae (insect) viruses (EPVs). EPVs were first discovered in the early 1960's, 
and have subsequently been shown to have a worldwide distribution. The subfamily contains 
three genera; A, B and C, which infect beetles, moths (lepidoptera) and grasshoppers, and midge 
flies respectively (Moyer, R.W. [1994] Entomopoxviruses, p. 392-397, Encyclopedia of 
Virology , R.G. Webster and A. Granoff (eds.), Academic Press Ltd, London). It should be 

30 recognized that classification within the three EPV genera is based solely on morphological and 
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host range criteria and not molecular properties. Indeed, it is now clear that the group B viruses 
of butterflies and moths (lepidoptera) and grasshoppers (orthoptera) are quite distinct from one 
another (Afonso, C.L., E.R. Tulman, Z. Lu, E. Oma, G.F. Kutish, and D.L. Rock [1999] "The 
genome of Melanoplus sanguinipes Entomopoxvirus" J. Virol. 73:533-552). AmEPV was 
5 originally isolated in India from the red hairy caterpillar, and it is the prototype virus of this 
group. This is primarily because of its ability to be easily grown in cultured insect cells, although 
certain Choristoneura and Heliothis EPVs have also been shown to replicate in cell cultures at 
low levels (Fernon, C.A., A.P. Vera, R. Crnov, J. Lai-Fook, R.J. Osborne, and D.J. Dall [1995] 
"Replication of Heliothis armigera entomopoxvirus in vitro" J. Invertebr. Pathol. 66:216-223; 

10 Lytvyn, V., Y. Fortin, M. Banville, B. Arif, and C. Richardson [1992] "Comparison of the 
thymidine kinase genes from three entomopoxviruses" J. Gen. Virol. 73:3235-3240). 

EPVs are the most distant relatives of mammalian poxviruses and exhibit both 
similarities and differences to the more commonly studied chordopoxviruses, such as vaccinia 
virus (VV). Similarities include morphology, a large linear double stranded genome (previously 

15 estimated at 225 kb for AmEPV, 190 kb for VV), common transcriptional regulation sequence 
motifs, non-spliced transcripts and a cytoplasmic site of replication. Differences include the 
G+C content of the viral DNA (a low 18% for AmEPV, 37% for VV), optimal growth 
temperatures (28°C for AmEPV, 37°C for VV), and host range. AmEPV does not replicate in 
vertebrate cells, and VV does not replicate in insect cells, although both viruses enter their 

20 respective non-permissive cells and initiate a replicative cycle (Langridge, W.H. [1983] 
"Detection of Amsacta moorei entomopoxvirus and vaccinia virus proteins in cell cultures 
restrictive for poxvirus multiplication" J. Invertebr. Pathol. 42:77-82). 

Generally, growth of AmEPV in insect cell cultures is similar to that of vertebrate 
poxviruses in mammalian cells. Receptors mediating poxvirus attachment and entry appear to be 

25 widespread and common, as EPVs infect vertebrate cells and W infects insect cells (Li, Y., R.L. 
Hall, and R.W. Moyer [1997] "Transient, nonlethal expression of genes in vertebrate cells by 
recombinant entomopoxviruses" J. Virol. 71:9557-9562; Li, Y., S. Yuan, and R.W. Moyer 
[1998] "The non-permissive infection of insect (gypsy moth) LD-652 cells by vaccinia virus" 
Virology 248:74-82). It is assumed by analogy with the vertebrate poxviruses that AmEPV gene 

30 expression patterns can be classified as early, intermediate and late, but experimental data is 
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minimal (Winter, J., R.L. Hall, and R.W. Moyer [1995] "The effect of inhibitors on the growth of 
the entomopoxvirus from Amsacta moorei in Lymantria dispar (gypsy moth) cells" Virology 
21 1:462-473). EPVs have been shown to contain vertebrate poxvirus promoter elements and 
early transcription termination motifs (Afonso, C.L., E.R. Tulman, Z. Lu, E. Oma, G.F. Kutish, 
5 and D.L. Rock [1999] "The genome of Melanoplus sanguinipes Entomopoxvirus" J. Virol 
73:533-552; Hall, R.L. and R.W. Moyer [1991] "Identification, cloning, and sequencing of a 
fragment of Amsacta moorei entomopoxvirus DNA containing the spheroidin gene and three 
vaccinia virus-related open reading frames" J. Virol 65:6516-6527). The most unique feature of 
poxvirus replication is development mostly, if not exclusively, within the cytoplasm. As a 
1 0 consequence of cytoplasmic development, EPV promoters (like those of vertebrate poxviruses) 
are recognized only by the virally encoded transcription system. The general availability of 
poxvirus specific promoters, coupled with exclusion of the nuclear transcription apparatus are 
major advantages for engineering and control of foreign gene expression related to gene therapy 
applications. 

15 EPVs, like W, contain a number of genes which are nonessential for growth in cell 

culture. Two examples are the thymidine kinase (TK) and spheroidin genes. The spheroidin 
gene can be viewed as a counterpart to the polyhedrin and A-type (ATI) occlusion genes of 
baculoviruses and cowpox viruses respectively. VV also contains an ATI gene, but it is 
defective. Spheroidin is the most abundantly expressed AmEPV gene, and serves to "occlude" 

20 infectious virions within an environmentally resistant occlusion body. Both the AmEPV TK and 
spheroidin gene can readily serve as sites for insertion and expression of foreign genes by 
utilizing standard plasmid-mediated recombination. 

Entomopoxvirus (EPVs) productively infect and kill only insects (Granados, R.R. [1981] 
"Entomopoxvirus infections in insects," in Pathogenesis of Invertebrate Microbial Disease, p. 

25 102-126, Davidson, E.W. (ed.) New Jersey, Allanheld Totowa) and can be isolated from 
Amsacta moorei (AmEPV), the red hairy caterpillar. Entomopox viruses and vectors have been 
described (See, for example, U.S. Patent Nos. 5,721,352 and 5,753,258, the disclosure of which 
is incorporated herein by reference). Like other EPVs, AmEPV cannot productively infect 
vertebrate cells. Indeed, following addition of AmEPV to vertebrate (mouse L-929) cells at 

30 multiplicities up to 10 particles/cell, no changes in cellular morphology (as judged by phase 
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contrast microscopy) are detected (Langridge, W.H. [1983] "Detection of Amsacta moorei 
entomopoxvirus and vaccinia virus proteins in cell cultures restrictive for poxvirus 
multiplication" 1 Invertebr. Pathol 42:77-82). 

AmEPV infects vertebrate cells in a non-cytocidal manner and the infection is abortive. 
5 Like all poxviruses, the virus is cytoplasmic and does not normally enter the nucleus. A 
consequence of this unusual biology, is that all poxvirus mediated gene expression takes place in 
the cytoplasm in the infected cell. AmEPV promoters and those of the eucaryotic cell are 
completely different and cellular promoters are not recognized by the AmEPV transcription 
machinery nor are AmEPV viral promoters recognized by RNA polymerase II of the host cell. 

10 

Brief Summary of the Invention 
The subject invention concerns a novel viral vector system for gene therapy based on an 
insect poxvirus designed to deliver genes for integration and stable, permanent expression in 
vertebrate cells. In an exemplified embodiment, a recombinant AmEPV vector was constructed 

1 5 that contains heterologous genes under the control of promoters that the drive the expression of 
the heterologous genes in vertebrate cells. The gfp gene and the gene encoding G41 8 resistance 
were used in an exemplified construct. The recombinant AmEPV was used to infect vertebrate 
cells and following infection the cells were transferred to media containing G418. Cells 
expressing both GFP and G418 resistance were obtained. Thus, the vectors of the subject 

20 invention can be used to deliver large DNA segments for the engineering of vertebrate cells. 

The subject invention also concerns cells that have been infected with or transformed 
with a recombinant vector of the present invention. The subject invention also concerns methods 
for providing gene therapy for conditions or disorders of an animal requiring therapy, such as 
genetic deficiency disorders. 

25 In addition, the subject invention concerns novel AmEVP polypeptides and the 

polynucleotide sequences which encode these polypeptides. The AmEPV polynucleotide 
sequences of the subject invention encode a triacylglyceride lipase (SEQ ID NO: 1), a Cu^/Zn** 
superoxide dismutase (SOD) (SEQ ID NO: 2), a CPD photolyase (SEQ ID NO: 3), a 
baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 4), two poly(A) polymerase small 

30 subunits (SEQ ID NOS: 5 and 6), two DNA polymerases (SEQ ID NOS: 7 and 8), an ABC 
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transporter-like protein (SEQ ID NO: 9), a Kunitz-motif protease inhibitor (KPI) (SEQ ID NO: 

10), and a poly(A) polymerase large subunit (SEQ ID NO: 1 1). 

In addition, the subject invention concerns isolated AmEPV polypeptides encoded by the 

polynucleotide sequences of the subject invention, including a triacylglyceride lipase (SEQ ID 
5 NO: 12), a Cu^/Zn^ superoxide dismutase (SOD) (SEQ ID NO: 13), a CPD photolyase (SEQ 

ID NO: 14), a baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 15), two poly(A) 

polymerase small subunits (SEQ ID NOS : 1 6 and 1 7), two DNA polymerases (SEQ ID NOS : 1 8 

and 19), an ABC transporter-like protein (SEQ ID NO: 20), a Kunitz-motif protease inhibitor 

(KPI) (SEQ ID NO: 21), a poly(A) polymerase large subunit (SEQ ID NO: 22) and other 
1 0 AmEPV polypeptides. 

The subject invention further pertains to other entomopoxvirus sequences. 

Polynucleotides of the subject invention include, for example, sequences identified in the 

attached sequence listing, as well as the tables and figures and described by open reading frame 

position within the genome. 
15 In addition, the subject invention includes polynucleotides which hybridize with other 

polynucleotides of the subject invention. 

Polynucleotide sequences of this invention have numerous applications in techniques 

known to those skilled in the art of molecular biology having the benefit of the instant disclosure. 

These techniques include their use as insertion sites for foreign genes of interest, hybridization 
20 probes, for chromosome and gene mapping, in PCR technologies, and in the production of sense 

or antisense nucleic acids. 

Brief Description of the Drawings 
The file of this patent contains at least one drawing executed in color. Copies of this 
25 patent with color drawings(s) will be provided by the Patent and Trademark Office upon request 
and payment of the necessary fee. 

Figure 1 shows a physical map of an exemplified recombinant vector of the subject 
invention (p AmEPV TKUF5) in which a portion of the plasmid pTKUF5 has been cloned within 
the AmEPV TK gene flanking regions. TR is the AAV terminal repeat; pA is a polyadenylation 
30 site; SD/SA is the SV40 late splice donor, splice acceptor sequence. GFP, the green fluorescent 
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protein gene, is under the control of a CMV promoter. Neo, the neomycin resistance gene, is 
under the control of a herpes TK gene promoter. 

Figure 2 shows an electrophoretic analysis of transformed mammalian cell lines. Each 
lane contains HindUl digested genomic DNA. Lane P contains genomic DNA from 293 cells 
5 and pTR-UF5 plasmid, as a positive control. Lanes Al through A5 contain DNA extracted from 
transformed cell lines made by recombinant AmEPV (AmEPVpTKUFS) infection. Lanes Bl 
through B6 contain DNA obtained from cell lines transfected with plasmid pTR-UF5. 

Figure 3 shows expression of lacZ in recombinant AmEPV-infected mammalian cells. 
CV-1 cells were mock infected (A) or infected with various AmEPV lacZ recombinants, where 
10 lacL was under the control of the cowpox virus late ATI gene promoter (B), the late AmEPV 
spheroidin promoter (C), the M melonontha early fus promoter (D) or the AmEPV early esp 
promoter (E). Infection of human Huh-7 liver cells with the AmEPV TKesp-lacZ recombinant is 
also shown as an additional control (F). The infected cell monolayers were stained with X-gal 
24h postinfection. 

15 Figure 4 shows the survival of mammalian cells following infection by recombinant 

AmEPV TKesp-gfp. Subconfluent CV-1 cells were infected with AmEPV TKesp-gfp at an m.o.i. 
of 1 PFU/cell. The individual fluorescent cells were located and followed over a period of two to 
three days and periodically photographed with a fluorescent microscope. One fluorescent cell, 
identified 18 hours post infection (A), had divided into two cells by 24 (B) to 26 (C) h 

20 postinfection and by 50 h had become a small cluster of dividing cells (D). 

Figure 5 shows AmEPV-mediated /3-galactosidase expression in the muscle of mouse. 
2X10 6 PFU (100 til) of recombinant AmEPV -esplacZ was injected into the muscle of the hind 
leg of a mouse. As a control, mice were injected with the same amount of recombinant AmEPV- 
SPHlacZ or 100 fxl of PBS. Two days later, the mice were sacrificed, the muscle was excised 

25 from the injected area and cut into small pieces, and fixed with 3% formaldehyde for 30 min, 
then stained with X-gal. The muscle injected with recombinant AmEPV pTK-esplacZ showed 
/3-galactosidase expression. No /3-galactosidase expression was observed in control mice. 

Figure 6 shows transformed 293 cells (A) derived from the colony infected with 
recombinant AmEPV-TKUF5 which are G418 resistant showing that cells are GFP positive, as 

30 well as non-fluorescent, non-transformed 293 cells (B). 
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Figure 7 shows a linear map of the AmEPV genome, 0-1 39440(A) and 139441- 
232392(B). Predicted ORFs are numbered consecutively from left to right based upon the 
position of the initiating methionine codon. ORFs transcribed in a rightward direction are shown 
above the horizontal line designating the viral genome; ORFs transcribed to the left are below. 
5 ITRs are indicated by heavy black arrows. A distance of lkb is as shown. ChPV homologs are 
indicated with red numbers, additional MsEPV homologs are indicated with purple numbers. 
Some ORFs have been assigned function based upon BLAST data. 

Figure 8 shows a comparison of the genomic organization of AmEPV, MsEPV and W. 
AmEPV ITRs are positioned at the termini of the viral genome as indicated. AmEPV genes 

1 0 which have homology to VV genes are depicted in (A). AmEPV genes which have homology to 
MsEPV are depicted in (B). Genes in the AmEPV genome common to both MsEPV and VV are 
in (C). Unique genes encoded by AmEPV are shown in (D). 

Figure 9 shows a comparison of the spatial distribution of homologous genes between 
AmEPV, MsEPV and VV. A random sampling of genes conserved within the genomes of all 

15 three indicated viruses were plotted on the 1 19kb genome of VV, the 232kb AmEPV genome, 
and the 236kb MsEPV genome. From left to right on the AmEPV genome, the genes shown and 
their BLAST-assigned function are: AMV016, thymidine kinase; AMV035, membrane protein; 
AMV038, PAP large subunit; AMV050, DNA polymerase; AMV051, RP035; AMV066, 
RP0132; AMV105, VETF-L; AMV122, rifampicin resistance; AMV138, no BLAST-assigned 

20 function; AMV150, ATP/GTP binding protein; AMV166, RP019; AMV181, core protein; 
AMV186, no BLAST-assigned function; AMV205, VLTF-3; AMV221, RP0147; AMV232, 
membrane protein; AMV243, membrane protein; AMV249, no BLAST-assigned function. Plots 
compare both orientations of the AmEPV genome. (A) left to right, (B) right to left. 

Figure 10 shows residues shared between poxvirus poly(a) polymerase subunit 

25 homologs. Consensus shows the conservation between all five sequences. Insect consensus 
shows identity among the four EPV ORFs. AmEPV consensus displays identities between the 
two AmEPV subunits. 

Figure 1 1 shows the transmembrane domains possessed by the AmEPV ABC transporter 
protein. This graphic was produced by the THAMM program (Sonnhammer, E. L. L., Hejne, G., 

30 and Krogh, A. [1998] "A hidden Markov model for predicting transmembrane helices inprotein 
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sequences" Proc. of Sixth Int. Conf. on Intelligent Systems for Molecular Biology (J. Glasgow, 
T. Littlejohn, F. Major, R. Lathrop, D. Sankoff, and C. Sensen, Eds.), pp. 175-182. AAAI press, 
Menlo Park, CA.). The regions of the protein indicated by the thirteen bars can be seen to have a 
probability of 1 as transmembrane domains. Although not shown in this figure, the areas 
5 between these domains (residues 432-601 and 1097-1285) contain ABC transporter (ATP 
binding) motifs (Prosite PS0021 1). 

Figure 12 shows the amino acid sequence of the AmEPV serine protease inhibitor. 
Amino acid abbreviations are standard. The Kunitz family signature (Prostite PS00280) is 
shown underlined and italicized from residues 55 to 73. 

10 

Brief Description of the Sequences 
SEQ ID NO: 1 is the nucleotide sequence of the gene encoding AmEPV triacylglyceride 

lipase. 

SEQ ID NO: 2 is the nucleotide sequence of the gene encoding AmEPV Cu^/Zn** 
15 superoxide dismutase (SOD). 

SEQ ID NO: 3 is the nucleotide sequence of the gene encoding AmEPV CPD 
photolyase. 

SEQ ID NO: 4 is the nucleotide sequence of the gene encoding AmEPV baculo virus-like 
inhibitor of apoptosis (IAP). 
20 SEQ ID NO: 5 is the nucleotide sequence of the gene encoding a first AmEPV poly(A) 

polymerase small subunit. 

SEQ ID NO: 6 is the nucleotide sequence of the gene encoding a second AmEPV 
poly(A) polymerase small subunit. 

SEQ ID NO: 7 is the nucleotide sequence of the gene encoding a first AmEPV DNA 
25 polymerase. 

SEQ ID NO: 8 is the nucleotide sequence of the gene encoding a second AmEPV DNA 
polymerase. 

SEQ ID NO: 9 is the nucleotide sequence of the gene encoding AmEPV ABC 
transporter-like protein. 
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SEQ ID NO: 10 is the nucleotide sequence of the gene encoding AmEPV Kunitz-motif 
inhibitor (KPI). 

SEQ ID NO: 11 is the nucleotide sequence of the gene encoding AmEPV poly(A) 
polymerase large subunit. 
5 SEQ ID NO: 12 is the amino acid sequence for the AmEPV triacylglyceride lipase. 

SEQ ID NO: 13 is the amino acid sequence for the AmEPV Cu ++ /Zn H " superoxide 
dismutase (SOD). 

SEQ ID NO: 14 is the amino acid sequence for the AmEPV CPD photolyase. 
SEQ ID NO: 15 is the amino acid sequence for the AmEPV baculovirus-like inhibitor of 
10 apoptosis (IAP). 

SEQ ID NO: 16 is the amino acid sequence for the first AmEPV poly(A) polymerase 
small subunit. 

SEQ ID NO: 17 is the amino acid sequence for the second AmEPV poly(A) polymerase 
small subunit. 

15 SEQ ID NO: 18 is the amino acid sequence for the first AmEPV DNA polymerase. 

SEQ ID NO: 19 is the amino acid sequence for the second AmEPV DNA polymerase. 
SEQ ID NO: 20 is the amino acid sequence for the AmEPV ABC transporter-like 

protein. 

SEQ ID NO: 21 is the amino acid sequence for the AmEPV Kunitz-motif inhibitor 

20 (KPI). 

SEQ ID NO: 22 is the amino acid sequence for the AmEPV poly(A) polymerase large 
subunit. 

SEQ ID NOS: 23-27 is the nucleotide sequence of the AmEPV genome. 

25 Detailed Disclosure of the Invention 

The subject invention concerns three aspects of entomopoxviruses (EPVs) as novel 
recombinant vectors: (1) As a system for the expression of high levels of foreign proteins, (2) for 
the transient expression of foreign genes in mammalian cells and (3) for the stable transformation 
of vertebrate cells for the long term expression of foreign proteins. In addition, the subject 

30 invention provides the nucleotide sequence of the entire genome of genus B entomopoxvirus 
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from Amsacta moorei (AmEPV). Accordingly, the subject invention also concerns isolated 
polynucleotides encoding AmEPV proteins. 

The subject invention concerns novel recombinant vectors and methods for delivery and 
expression of heterologous polynucleotides in vertebrate cells. The recombinant vectors of the 
5 subject invention provide for stable integration and expression of heterologous DNA in the host 
cell. Advantageously, the vectors of the invention are adapted for accepting large heterologous 
polynucleotide inserts which can be delivered in an infected or transformed cell and expressed in 
a stable fashion. The subject invention can be used to provide gene therapy for conditions or 
disorders of vertebrate animals, such as a mammal or human, that is in need of such therapy. 

10 One aspect of the subject invention concerns a recombinant EPV vector which can 

optionally include heterologous DNA which can be expressed in a cell infected or transformed 
with the subject vector. Preferably, the EPV vector is derived from AmEPV. The recombinant 
EPV vectors of the present invention can optionally include inverted terminal repeat (ITR) 
sequences of a virus, such as, for example, adeno-associated virus, that flank the heterologous 

15 DNA insertion site on the vector. Thus, when the heterologous DNA is cloned into the 
recombinant EPV vector, the heterologous DNA is flanked upstream and downstream by the ITR 
sequences. 

In an exemplified embodiment, the subject vectors comprise heterologous DNA inserted 
within the vector. The heterologous DNA contained within the recombinant vectors of the 

20 invention can include polynucleotide sequences which encode a biologically functional protein. 
Preferably, the polynucleotides encode proteins which can provide therapeutic replacement or 
supplement in animals afflicted with disorders which result in the animal expressing abnormal or 
deficient levels of the protein that are required for normal biological function. Proteins encoded 
by the heterologous DNA can include, but are not limited to interleukins, cytokines, growth 

25 factors, interferons, enzymes, and structural proteins. Proteins encoded by the heterologous 
DNA can also include proteins that provide a selectable marker for expression, such as antibiotic 
resistance in eukaryotes. 

In a preferred embodiment, heterologous DNA within the subject vectors is operably 
linked with and under the control of regulatory sequences, such as promoters. The recombinant 

30 vectors of the invention preferably comprises a constitutive or regulatable promoter capable of 
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promoting sufficient levels of expression of the heterologous DNA contained in the viral vector 
in a vertebrate cell. Promoters useful with the subject vectors include, for example, the 
cytomegalovirus (CMV) promoters and the herpes TK gene promoter. The vectors can also 
include other regulatory elements such as introns inserted into the polynucleotide sequence of the 
5 vector. 

The strategy for generation of recombinant viruses is identical to that used for VV virus 
and takes advantage of the high levels of recombination with transfected plasmids mediated by 
these viruses. The basic procedure utilizes transfection of AmEPV-infected cells with an 
appropriately designed shuttle vector. Insertion of foreign genes occurs within a non-essential 

10 gene (e.g., spheroidin or TK). Because of the cytoplasmic nature of AmEPV, it is necessary to 
place all foreign genes under control of an AmEPV (early or late) poxvirus promoter. 
Recombinants are selected and subjected to three rounds of plaque purification before use. 

The subject invention also concerns cells containing recombinant vectors of the present 
invention. The cells can be, for example, vertebrate cells such as mammalian cells. Preferably, 

1 5 the cells are human cells. Cell lines infected or transformed with the recombinant vectors of the 
present invention are also within the scope of the invention. 

The recombinant vectors of the present invention can be introduced into suitable cells or 
cell lines by methods known in the art. If the recombinant vectors are packaged in viral particles 
then cells or cell lines can be infected with the virus containing the recombinant vector. Methods 

20 contemplated for introducing recombinant vector into cells or cell lines also include transfection, 
transduction and injection. For example, vectors can be introduced into cells using liposomes 
containing the subject recombinant vectors. Recombinant viral particles and vectors of the 
present invention can be introduced into cells by in vitro or in vivo means. 

Infection of vertebrate cells is non-permissive, in that early but not late AmEPV gene 

25 expression occurs (Li, Y., R.L. Hall, R.W. Moyer [1997] "Transient, nonlethal expression of 
genes in vertebrate cells by recombinant entomopoxviruses" J. Virol. 71:9557-9562). 
Specifically, if a reporter gene, such as lacZ y is driven by a late poxvirus promoter, such as either 
the AmEPV spheroidin or cowpox virus ATI (A-type Inclusion) promoter, no expression of 
galactosidase is observed. If, however, the lacZ is driven instead by either of two early EPV 

30 promoters (the Melolontha melolontha EPV fusolin gene promoter (Gauthier, L., F. Coussrans, 
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J.C Veyrunes, M. Bergoin [1995] "The Melolontha melolontha entomopoxvirus (MmEPV) 
fiisolin is related to the fiisolins of lepidoptera EPVs and to the 37 K baculovirus glycoprotein" 
Virology 208:427-436) or the 42 kDa early AmEPV protein (Li et al [1997] supra), high levels 
of galactosidase in the recombinant AmEPV infected vertebrate cells are observed. These results 
5 provide clear evidence of AmEPV entry into vertebrate cells followed by early, but not late, viral 
gene expression. 

It has also been found that vertebrate cells survive infection by AmEPV. If CV-1 cells 
are infected with an AmEPV recombinant which contains the green fluorescent protein (GFP) 
gene regulated by the 42 kDa AmEPV early promoter (also called the esp promoter), single, 

10 fluorescent cells are initially observed which then proceed to grow and divide, ultimately 
forming small clusters of fluorescent cells. Therefore, AmEPV enters vertebrate cells, to 
produce a non-permissive, abortive infection, early viral genes are expressed and infected cells 
appear to survive and continue to divide. These properties plus a very large capacity of the virus 
for foreign genes make AmEPV an excellent vector for delivery of genes for expression in a 

15 transient fashion. 

However, in addition to the ability to express foreign genes in a transient fashion, it has 
been found that AmEPV vectors of the subject invention have the ability to stably transform cells 
and express genes in a long term fashion as well. The data presented within the Examples (e.g. , 
Example 2) and accompanying Figures (e.g., Figure 2) confirm that the AmEPV vectors of the 

20 subject invention can be used to deliver DNA which subsequently integrates into DNA of the 
mammalian cell nucleus. The ability of AmEPV to deliver DNA to mammalian cells creates 
endless opportunity for use of the vector in the stable transformation and engineering of 
vertebrate cells. 

The Examples describe methodology for growth, titration and preparation of recombinant 
25 AmEPV, as well as transient expression of AmEPV in vertebrate cells, the use of AmEPV to 
stably transform mammalian cells, and potential uses of AmEPV vectors. 

In addition to entomopoxviruses (EPVs) as novel recombinant vectors, and methods of 
their use, the subject invention provides the nucleotide sequence of the entire genome of the 
genus B entomopoxvirus from Amsacta moorei (AmEPV). This enhances the value of AmEPV 
30 as a vector, and particularly as a gene therapy vector, in a number of ways. For example, specific 
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knowledge of the AmEPV genome facilitates the identification of additional sites which maybe 
used as insertion sites for foreign genes of interest. 

In addition, the subject invention concerns novel AmEVP polypeptides and the 
polynucleotide sequences which encode these polypeptides. The AmEPV polynucleotide 
5 sequences of the subject invention include polynucleotides encoding a triacylglyceride lipase 
(SEQ ED NO: 1), a Cu^/Zn^ superoxide dismutase (SOD) (SEQ ID NO: 2), a CPD photolyase 
(SEQ ID NO: 3), a baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 4), two poly(A) 
polymerase small subunits (SEQ ID NOS: 5 and 6), two DNA polymerases (SEQ ID NOS: 7 and 
8), an ABC transporter-like protein (SEQ ID NO: 9), a Kunitz-motif protease inhibitor (KPI) 
10 (SEQ ID NO: 10), and a poly(A) polymerase large subunit (SEQ ID NO: 11) and other 
polynucleotides. 

In addition, the subject invention concerns isolated AmEPV polypeptides encoded by the 
polynucleotide sequences of the subject invention, including a triacylglyceride lipase (SEQ ID 
NO: 12), a Cu^/Zn^ superoxide dismutase (SOD) (SEQ ID NO: 13), a CPD photolyase (SEQ 
15 ID NO: 14), a baculovirus-like inhibitor of apoptosis (IAP) (SEQ ID NO: 15), two poly(A) 
polymerase small subunits (SEQ ID NOS : 1 6 and 1 7), two DNA polymerases (SEQ ID NOS : 1 8 
and 19), an ABC transporter-like protein (SEQ ID NO: 20), a Kunitz-motif protease inhibitor 
(KPI) (SEQ ID NO: 21), and a poly(A) polymerase large subunit (SEQ ID NO: 22) and other 
AmEPV polypeptides. 

20 The subject invention includes other AmEPV sequences, as described in Table 1, for 

example. In addition, the subject invention includes polynucleotides which hybridize with other 
polynucleotides of the subject invention. 

The genome of the genus B entomopoxvirus from Amsacta moorei (AmEPV) (SEQ ID 
NOS: 23-27) was sequenced and found to contain 232,392 bases with 279 unique open reading 

25 frames (ORFs) of greater than 60 amino acids. The central core of the viral chromosome is 
flanked by 9.4kbp inverted terminal repeats (ITRs), each of which contain 13 ORFs, raising the 
total number of ORFs within the viral chromosome to 292. Default E (EXPECT) values of 
<0.01 were used to define homology to sequences in current databases. ORFs lacking homology 
to other poxvirus genes were shown to comprise 33.6% of the viral genome. Approximately 

30 28.6% of the AmEPV genome (52 AmEPV ORFs) encodes homologues of the mammalian 
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poxvirus co-linear core genes, which are found dispersed throughout the AmEPV chromosome. 
There is also no significant gene order conservation between AmEPV and the orthopteran genus 
B poxvirus of Melanoplus sanguinipes (MsEPV). Novel AmEPV genes include those encoding 
an ABC transporter and a Kunitz motif protease inhibitor. The most unusual feature of the 
5 AmEPV genome relates to the viral encoded poly(A) polymerase. In all other poxviruses this 
heterodimeric enzyme consists of a single large and small subunit. However, AmEPV appears to 
encode one large and two distinct small poly (A) polymerase subunits. AmEPV is one of the few 
entomopoxviruses which can be grown and manipulated in cell culture. 

It is commonly observed in poxvirus genomes that the ORFs situated near the ends of the 

10 genome are preferentially transcribed towards the closest termini (Upton, C, Macen, J. L., 
Maranchuk, R. A., Delange, A. M, and McFadden, G. [1988] "Tumorigenic poxviruses: fine 
analysis of the recombination junctions in malignant rabbit fibroma virus, a recombinant between 
Shope fibroma virus and myxoma virus" Virology 166, 229-239; Upton, C. and McFadden, G. 
[ 1 986] "DNA sequence homology between the terminal inverted repeats of Shope fibroma virus 

1 5 and an endogenous cellular plasmid species" Mol Cell Biol 6 , 265-276). However, as can been 
seen in Figure 8, this is not the case with the ORFs of AmEPV, where no discernable pattern of 
transcription of genes near the termini can be observed. 

Previous estimates placed the A+T content of the AmEPV genome at 81 .5% (Langridge, 
W. H. R. [1983] "Partial Characterization of DNA from Five Entomopoxviruses" J. 

20 Invertebr.Path 42, 369-375). A recalculation based upon the known sequence has raised this to 
82.2%, making AmEPV the most A+T rich poxvirus sequenced to date. In this regard, it is 
interesting to note that 62% of all encoded amino acids are either He, Leu, Phe, Asn or Lys, in 
descending order of frequency, which are comprised mainly of A+T rich codons. The unusually 
high A+T content may also be reflected in the large number of translational stop codons (8.9% of 

25 coding capacity) and relatively few methionine encoding triplets (1.6%). 

Table 1 lists all the ORFs encoded by the AmEPV genome, and functions assigned to the 
encoded proteins. Default E (EXPECT) values of <0.01 were used to define homology to 
sequences in current databases. 52 AmEPV ORFs (28.6% of the genome) show homology to 
ChPV genes, and 91 ORFs (31.3% of the genome) have homologs in EPVs or other insect 

30 viruses. The terminal regions of AmEPV contain few genes homologous to any other gene. 
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Figure 8 illustrates this phenomenon, as well as the observation that AmEPV homologs of both 
vaccinia and MsEPV genes (which we have used as available examples for the ChPV and EPV) 
are positioned more towards the centre of the AmEPV genome. In contrast, novel AmEPVgenes 
are easily identified as occurring more often towards the genomic termini. 



Table 1 . Predicted ORFs of the AmEPV genome. 
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(see materials and methods), c U = genes not found in other poxviruses; E = genes found in other entomopoxviruses; C = genes found in 
chordopoxviruses, f promoter type E = early; L= late, promoters with ambiguous motifs designated by ?. hp = hypothetical protein. * designates an 
E score too low to be quantified. Vaccinia homologs are shown in parentheses, MsEPV homologs not already listed are in brackets, braces contain 
formerly named AmEPV genes. 



Vertebrate poxviruses have been shown to generally share a co-linear arrangement of core 
genes (Goebel, S. J., Johnson, G. P., Perkus, M. E., Davis, S. W., Winslow, J. P., andPaoletti, E. 
[1990] "The complete DNA sequence of vaccinia virus" Virology 179, 247-66, 517-63; 
Massung, R. F., Liu, L. L, Qi, J., Knight, J. C, Yuran, T. E., Kerlavage, A. R., Parsons, J. M., 
Venter, J. C, and Esposito, J. J. [1994] "Analysis of the complete genome of smallpox variola 
major virus strain Bangladesh- 1975" Virology 201 :215-240; Senkevich, T. G., Koonin, E. V., 
Bugert, J. J., Darai, G., and Moss, B. [1997] "The genome of molluscum contagiosum virus: 
Analysis and comparison with other poxviruses" Virology 233: 19-42; Afonso, C. L., Tulman, E. 
R., Lu, Z., Zsak, L., Kutish, G. F., and Rock, D. L. [2000] "The Genome of Fowlpox Virus"/. 
Virol 74:3815-3831). Sequence information from a number of EPVs suggested that this co- 
linear arrangement of core genes is not conserved in members of the EPV subfamily (Hall, R. L. 
and Moyer, R. W. [1993] "Identification of an Amsacta spheroidin-like protein within the 
occlusion bodies of Choristoneura entomopoxviruses" Virology 192:179-187; Sriskantha, A., 
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Osborne, R. J., and Dall, D. J. [1997] "Mapping of the Heliothis armigera entomopoxvirus 
(HaEPV) genome, and analysis of genes encoding the HaEPV spheroidin and nucleoside 
triphosphate phosphohydrolase I proteins" J Gen Virol 78:31 15-3123; Afonso, C. L., Tulman, E. 
R., Lu, Z., Oma, E., Kutish, G. F., and Rock, D. L. [1999] "The genome of Melanoplus 
5 sanguinipes Entomopoxvirus" 1 Virol 73, 533-552). The complete genomic sequence of 
AmEPV enables us to unequivocally confirm this, but also shows there is no conserved co-linear 
core between viruses of genus B. Figure 9 graphically illustrates the absence of any type of 
shared spatial gene arrangement between a typical ChPV (VV), MsEPV and the genome of 
AmEPV. Note that flipping the AmEPV genome direction from 3' to 5 ? does not lessen the 
1 0 degree of gene shuffling which has occurred within these different viruses. 

Promoter consensus sequences 

AmEPV contains promoter elements which govern gene expression. 133 AmEPV genes 
are considered to be early, or potentially early. 158 genes possess motifs which result in late, or 

1 5 potentially late promoters. Only 1 5 genes from the entire 279 gene genome have no recognizable 
promoter or regulatory elements. Genes that contain the sequences TGAAAXXXXA or 
TGAATXXXXA within 1 00 bases of their translational start codons were considered early (E) or 
potentially early (E?), respectively (Table 1). This motif resembles the ChPV early promoter 
core consensus sequence (Moss, B. [1996] Poxviridae: The viruses and their replication. In 

20 "Fields Virology" (B. N. Fields, D. M. Knipe, and P. M. Howley, Eds.), Vol. 2, pp. 2637-2672, 
Lippincott-Reven, Philadelphia; Senkevich, T. G., Koonin, E. V., Bugert, J. J., Darai, G., and 
Moss, B. [1997] "The genome of molluscum contagiosum virus: Analysis and comparison with 
other poxviruses" Virology 233:19-42) and was also used to predict early genes of MsEPV 
(Afonso, C.L., E.R. Tulman, Z. Lu, E. Oma, G.F. Kutish, and D.L. Rock [1999] "The genome of 

25 Melanoplus sanguinipes Entomopoxvirus" J. Virol. 73:533-552). These motifs have been found 
upsteam of known EPV early genes such as the TK gene (Gruidl, M. E., Hall, R. L., and Moyer, 
R. W. [1992] "Mapping and molecular characterization of a functional thymidine kinase from 
Amsacta moorei entomopoxvirus" Virology 186:507-516; Lytvyn, V., Fortin, Y., Banville, M., 
Arif, B., and Richardson, C. [1992] "Comparison of the thymidine kinase genes from three 

30 entomopoxviruses" J. Gen. Virol 73:3235-3240), CbEPV DNA polymerase (Mustafa, A. and 
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Yuen, L. [1991] "Identification and sequencing of the Choristoneura biennis entomopoxvirus 
DNA polymerase gene" DNA Seq. 2:39-45), and the MmEPV fusolin gene (Li, Y., Hall, R. L., 
and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate cells by 
recombinant entomopoxviruses" J. Virol 71 :9557-9562; Gauthier, L., Cousserans, F., Veyrunes, 
5 J. C, and Bergoin, M. [ 1 995] "The Melolontha melolontha entomopoxvirus (MmEPV) fiisolin is 
related to the fusolins of lepidopteran EPVs and to the 37K baculo virus glycoprotein" Virology 
208:427-436). Out of the 36 early MsEPV gene homologs in AmEPV, 27 contain predicted early 
promoters. For comparison with vertebrate poxvirus homologs, we adopted the most recently 
published mammalian poxvirus promoter sequence predictions (Cameron, C, Hota-Mitchell, S., 

10 Chen, L., Barrett, J., Cao, J. X., Macaulay, C, Wilier, D., Evans, D., and McFadden, G. [1999] 
"The complete DNA sequence of myxoma virus" Virology 264:298-318; Wilier, D. O., 
McFadden, G., and Evans, D. H. [1999] "The complete genome sequence of shope (Rabbit) 
fibroma virus" Virology 264:319-343), which emanated from the earlier work of Davison and 
Moss (Davison, A. J. and Moss, B. [1989] "Structure of vaccinia virus late promoters" J. Mol. 

15 Biol 210:771-784). Out of the 1 6 early MYX homologs in AmEPV, 11 contained early promoter 
elements. As a first approximation, candidate genes were considered early only if they contained 
the vaccinia virus early transcription termination sequence (TTTTTNT) near the 3' end of the 
gene; (Gruidl, M. E., Hall, R. L., and Moyer, R. W. [1992] "Mapping and molecular 
characterization of a functional thymidine kinase from Amsacta moorei entomopoxvirus" 

20 Virology 186:507-516; Li, X., Barrett, J. W., Yuen, L., and Arif, B. M. [1997] "Cloning, 
sequencing and transcriptional analysis of the Choristoneura fumiferana entomopoxvirus 
spheroidin gene" Virus Res. 47:143-154; Sriskantha, A., Osborne, R. J., and Dall, D. J. [1997] 
"Mapping of the Heliothis armigera entomopoxvirus (HaEPV) genome, and analysis of genes 
encoding the HaEPV spheroidin and nucleoside triphosphate phosphohydrolase I proteins"/ Gen 

25 Virol 78:3115-3123; Yuen, L. and Moss, B. [1987] "Oligonucleotide sequence signaling 
transcriptional termination of vaccinia virus early genes" Proc. Natl Acad. Sci. U.S.A. 84:641 7- 
6421). There are examples of early genes which contain the TTTTTNT motif towards the 5' N- 
terminal portion of the ORF such as the DNA polymerase of myxoma (Cameron, C, Hota- 
Mitchell, S., Chen, L., Barrett, J., Cao, J. X., Macaulay, C, Wilier, D., Evans, D., and 

30 McFadden, G. [1999] "The complete DNA sequence of myxoma virus" Virology 264:298-3 18) 
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and Shope fibroma (Wilier, D. O., McFadden, G., and Evans, D. H. [1999] "The complete 
genome sequence of shope (Rabbit) fibroma virus" Virology 264:3 1 9-343) viruses. Therefore, 
our estimates of early genes may be low. This sequence is found within 1 00 bases 3' of the ORF 
of 42 of the 1 16 predicted early AmEPV genes. 
5 AmEPV ORFs that contained the sequence TAAATG at the translational start site were 

considered late genes (L) (Bertholet, C, Stocco, P., Van Meir, E., and Wittek, R. [1986] 
"Functional analysis of the 5 ' flanking sequence of a vaccinia virus late gene" EMBO J. 5:1951- 
1957; Rosel, J. and Moss, B. [1985] "Transcriptional and translational mapping and nucleotide 
sequence analysis of a vaccinia virus gene encoding the precursor of the major core polypeptide 

10 4b" J. Virol 56:830-838; Weir, J. P. and Moss, B. [1984] "Regulation of expression and 
nucleotide sequence of a late vaccinia virus gene" J. Virol 51 :662-669; Davison, A. J. and Moss, 
B. [1989] "Structure of vaccinia virus late promoters" /. Mol Biol 210:771-784). This late 
promoter consensus sequence has been observed in other EP V late genes such as spheroidin, the 
nucleoside triphosphatase (NTPase), and hydrolase I (NPH-I), and topoisomerase (Hall, R. L., Li, 

15 Y., Feller, J. A., and Moyer, R. W. [1996] "The Amsacta moorei entomopoxvirus spheroidin 
gene is improperly transcribed in vertebrate poxviruses" Virology 224:427-436; Hall, R. L. and 
Moyer, R. W. [1991] "Identification, cloning, and sequencing of a fragment of Amsacta moorei 
entomopoxvirus DNA containing the spheroidin gene and three vaccinia virus-related open 
reading frames"/. Virol 65:6516-6527; Li, X., Barrett, J. W., Yuen, L., and Arif, B. M. [1997] 

20 "Cloning, sequencing and transcriptional analysis of the Choristoneura fumiferana 
entomopoxvirus spheroidin gene" Virus Res. 47: 143-1 54; Sanz, P., Veyrunes, J. C, Cousserans, 
F., and Bergoin, M. [1994] "Cloning and sequencing of the spherulin gene, the occlusion body 
major polypeptide of the Melolontha melolontha entomopoxvirus (MmEPV)" Virology 202:449- 
457; Sriskantha, A., Osborne, R. J., and Dall, D. J. [1997] "Mapping of the Heliothis armigera 

25 entomopoxvirus (HaEPV) genome, and analysis of genes encoding the HaEPV spheroidin and 
nucleoside triphosphate phosphohydrolase I proteins" J Gen Virol 78:311 5-3 123). Genes that 
contained the sequences TAAAT or TAAAAT within 1 00 bases upstream from their start codon 
were also potentially considered late genes (L?) (Table 1). These sequences have been found 
upstream of other late vertebrate poxvirus genes (Roseman, N. A. and Hruby, D. E. [1987] 
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"Nucleotide sequence and transcript organization of a region of the vaccinia virus genome which 
encodes a constitutively expressed gene required for DNA replication"/. Virol 61 : 1398-1406). 



Terminal regions 

5 AmEP V is one of the few entompox viruses which can be easily and reliably replicated in 

tissue culture (Winter, J., Hall, R. L., and Moyer, R. W. [1995] "The effect of inhibitors on the 
growth of the entomopoxvirus from Amsacta moorei in Lymantria dispar (gypsy moth) cells" 
Virology 211: 462-473; Hall, R. L., Li, Y., Feller, J. A., and Moyer, R. W. [1 996] "The Amsacta 
moorei entomopoxvirus spheroidin gene is improperly transcribed in vertebrate poxviruses" 

10 Virology 224:427-436). Because of this, we were able to obtain DNA for sequencing from a 
single clonal virus plaque, thus minimizing template heterogeneity. The results of sequencing 
from non-clonally isolated template DNA can be seen in the resultant sequence of MsEPV, 
where the two inverted terminal repeat (ITR) regions are not identical (Afonso, C. L., Tulman, E. 
R., Lu, Z., Oma, E., Kutish, G. F., and Rock, D. L. [1999] "The genome of Melanoplus 

15 sanguinipes Entomopoxvirus"/. Virol 73, 533-552). 

Poxvirus ITRs can vary considerably in size. The smallest ITRs are those of variola 
Bangladesh which are only 725 bp (Massung, R. F., Esposito, J. J., Liu, L. L, Qi, J., Utterback, T. 
R, Knight, J. C, Aubin, L., Yuran, T. E., Parsons, J. M., Loparev, V. N., Selivanov, N. A., 
Cavallaro, K. F., Kerlavage, A. R., Mahy, B. W. J., and Venter, J. C. [1993] "Potential virulence 

20 determinants in terminal regions of variola smallpox virus genome" Nature 366:748-751; 
Massung, R. F., Liu, L. I., Qi, J., Knight, J. C, Yuran, T. E., Kerlavage, A. R., Parsons, J. M., 
Venter, J. C, and Esposito, J. J. [1994] "Analysis of the complete genome of smallpox variola 
major virus strain Bangladesh- 1975" Virology 201:215-240). The AmEPV genome contains 
identical ITR tandem repeats of 9.4 kbp at both termini which are organized in a fashion similar 

25 to that of other poxviruses; i.e. a series of tandemly repeated sequences interspersed with non- 
repetitive spacer region (Figure 1) (Massung, R. F., Knight, J. C, and Esposito, J. J. [1995] 
"Topography of variola smallpox virus inverted terminal repeats" Virology 21 1 :350-355; Wittek, 
R., Menna, A., Muller, H. K., Schumperli, D., Boseley, P. G., and Wyler, R. [1978] "Inverted 
terminal repeats in rabbit poxvirus and vaccinia virus DNA' V. Virol 28:171-181;Upton,C. and 

30 McFadden, G. [1986] "DNA sequence homology between the terminal inverted repeats of Shope 
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♦fibroma virus and an endogenous cellular plasmid species" Mol Cell Biol 6 , 265-276; Afonso, 
C. L., Tulman, E. R., Lu, Z., Oma, E., Kutish, G. F., and Rock, D. L. [1999] "The genome of 
Melanoplus sanguinipes Entomopoxvirus" J. Virol 73,533-552). Myxoma virus and AmEPV 
share a similar ITR structure, in that the ORFs encoded in this region extend to the very ends of 
5 the genome termini, and contain very little non-coding DNA (Cameron, C, Hota-Mitchell, S., 
Chen, L., Barrett, J., Cao, J. X., Macaulay, C, Wilier, D., Evans, D., and McFadden, G. [1999] 
"The complete DNA sequence of myxoma virus" Virology 264:298-318). Other sequenced 
poxvirus ITRs contain smaller numbers of genes interspersed with large regions of non-coding 
DNA within them. Other examples of poxvirus ITRs include MsEPV, which contains 3kb of 

1 0 non-coding DNA within the ITRs (Afonso, C. L., Tulman, E. R., Lu, Z., Oma, E., Kutish, G. F., 
and Rock, D. L. [1999] "The genome of Melanoplus sanguinipes Entomopoxvirus"./. Virol 73, 
533-552), 6kb in VV Ankara strain (Antoine, G., Scheiflinger, F., Dorner, F., and Falkner, F. G. 
[1998] "The complete genomic sequence of the modified vaccinia Ankara strain: Comparison 
with other orthopoxviruses" Virology 244:365-396) and 3kb in MCV (Senkevich, T. G., Koonin, 

15 E. V., Bugert, J. J., Darai, G., and Moss, B. [1997] "The genome of molluscum contagiosum 
virus: Analysis and comparison with other poxviruses" Virology 233 : 1 9-42). As seen in Table 1 , 
each of the ORFs within the AmEPV ITR encodes a protein with no BLAST-derived function. 
The exception to this is AMVITR1 . This most terminal gene has homology to MsEPV MSV01 0. 
Although this gene is not within the ITR region of the MsEPV genome, it is located towards the 

20 left terminus. The gene encodes a member of the leucine rich gene family protein. 

Spontaneous DNA arrangements occur with an increased frequency at or near the 
terminal inverted repeat sequences of poxviral genomes (Moyer, R. W., Graves, R. L., and 
Rothe, C. T. [1980] "The white pock (mu) mutants of rabbit poxvirus, in. Terminal DNA 
sequence duplication and transposition in rabbit poxvirus" Cell 22:545-553). Indeed, the 

25 majority of novel and non-essential genes are generally found within poxviral ITRs or toward the 
genomic termini. 

AmEPV and vertebrate poxvirus gene homologs 

The complete genomic sequences of vaccinia and variola viruses from the orthopoxvirus 
30 genus, myxoma and Shope fibroma viruses from the leporipoxvirus genus, fowlpox from the 
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avipoxvirus genus, the molluscipoxvirus molluscum contagiosum and the genus B EPV, MsEPV 
have allowed definition of conserved poxvirus genes present in most, if not all, poxviruses. 
Inclusion of the AmEPV genomic sequence extends that concept. 

Inspection of the AmEPV sequence shows 52 ORFs which have homology to genes 
5 found in ChPV (Table 2). Of these, 44 have been assigned a function. Of the 44 ORFs with an 
assigned function, 18 are derived from proteins involved in mRNA synthesis which include 5 
ORFs comprising an RNA polymerase, 4 ORFs likely to encode transcription factors, 3 ORFs 
related to helicases/NTPases and 5 ORFs devoted to post-transcriptional mRNA modifications. 
Of the ORFs devoted to mRNA modification, the poly(A) polymerase deserves special mention. 

10 Normally, this heterodimeric enzyme consists of a large and small subunit. However, the 
AmEPV sequence reveals the presence of three rather than the expected two potential subunits. 
The ORFs AMV038, AMV060 and AMV1 1 5 are predicted to represent one large and two small 
poly(A) polymerase subunits respectively. This unusual feature will be discussed in a 
subsequent section. There are 7 homologous ORFs involved in functions of DNA 

1 5 replication/repair, which include a DNA polymerase, photolyase, nucleotide phosphohydrolase, 
DNA topoisomerase and a uracil DNA glycosylase. Interestingly, neither AmEPV nor MsEPV 
encode a homolog of the vaccinia I3L protein. The I3L protein is a DNA binding protein and is 
presumably involved in DNA replication (Davis, R. E. and Mathews, C. K. [1993] "Acidic C 
terminus of vaccinia virus DNA-binding protein interacts with ribonucleotide reductase" Proc. 

20 Natl Acad. Sci. U.S.A. 90:745-749). Ten ChPV/EPV ORFs are associated with conserved virus 
structural proteins. Finally, there are 8 ORFS associated with enzymatic activities not strictly 
related to nucleic acid metabolism. 

Table 2. Chordopoxvirus homologs found within AmEPV. 

25 



AmEPV 
ORF 


Length 

amino 
acids 


MsEPV 
ORF 


Length 

amino 
acids 


W 
ORF 


Length 

amino 
acids 


Gene name 
and/or function 


Transcription/RNA Modification 


RNA polymerase 


AMV051 


349 


149 


.348 


A29L 


305 


RP035 


AMV054 


822 


119 


807 


H4L 


795 


RAP94 
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AmEPV 
ORF 


L ngth 

amino 
acids 


MsEPV 
ORF 


L ngth 

amino 
acids 


W 
ORF 


Length 

amino 
acids 


Gene name 
and/or function 


AMV066 


1196 


155 


1190 


A24R 


1164 


RP0132 


AMV166 


237 


100 


230 


A5R 


164 


RP019 


AMV221 


1301 


43 


1319 


J6R 


1286 


RP0147 


AMV230 


179 


245 


186 


D7R 


161 


RP018 


Transcription Factors 


AMV047 


259 


187 


261 


A1L 


150 


VLTF-2 


AMV091 


356 


52 


345 


A23R 


382 


transcription factor 


AMV105 


767 


63 


760 


A7L 


710 


VETF-L 


AMV174 


670 


113 


674 


D6R 


637 


VETF-s 


AMV205 


228 


65 


218 


A2L 


224 


VLTF-3 


NTPase/helicase 


AMV059 


469 


148 


471 


A18R 


493 


DNA helicase 


AMV081 


720 


86 


717 


I8R 


676 


RNA helicase 


AMV192 


648 


53 


| 647 


D11L 


631 


NPH-I 


mRNA modification 


AMV038 


573 


143 


571 


E1L 


479 


PAP-L 


AMV060 


295 


41 


293 


J3R 


333 


PAP-S 


AMV093 


262 


124 


267 


D12L 


287 


mRNA capping small subunit 


AMV115 


293 


41 


295 


J3R 


333 


PAP-S 


AMV135 


627 


67 


860 


D1R 


844 


mRNA capping large subunit 


DNA replication/repair 


AMV016 


182 


N/A 


N/A 


J2R 


177 


Thymidine Kinase 


AMV025 


453 i 


235 ! 


466 


S127L* 


445 


CPD photolyase 


AMV050 


1105 


36 


964 


E9L 


1006 


DNA polymerase 


AMV052 


333 


130 


328 


H6R 


314 


DNA topoisomerase 


AMV058 


276 


150 


289 


D10R 


248 


NTP pyrophosphohydorlase/ 
mutT 


AMV087 


726 


89 


834 


D5R 


785 


NTPase 


AMV231 


344 


208 


232 


D4R 


218 


uracil DNA glycosylase UNG 


Structural 


AMV035 


336 


121 


333 


G9R 


340 


membrane protein 


AMV061 


255 


158 


293 


L4R 


251 


30K virion protein 
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MlDtrV 

ORF 


lc n gin 

amino 
acids 


mscr v 
ORF 


1 nnfh 

amino 
acids 


WW 
V V 

ORF 


1 nnth 

l. ngin 
amino 
acids 


Con n amo 
veil iidim? 

and/or function 


AMV118 


386 


90 


380 


A16L 


378 


membrane protein 


AMV122 


567 


69 


584 


D13L 


551 


rifampicin resistance gene 


AMV139 


1149 


152 


1306 


A10L 


891 


P4a core protein 


AMV147 


668 


164 


648 


A3L 


644 


P4b core protein 


AMV181 


464 


189 


464 


I7L 


423 


core protein 


AMV217 


246 


183 


242 


L1R 


250 


myristylated membrane protein 


AMV232 


140 


142 


139 


J5L 


133 


membrane protein 


AMV243 


248 


94 


241 


F9L 


212 


membrane protein 


Enzymes 


AMV078 


165 


N/A 


N/A 


S069L* 


173 


protein tyrosine phosphatase 


AMV114 


105 


93 


107 


E10R 


95 


put. redox 


AMV133 


287 


48 


288 


M5L** 


75 


lipase 


AMV150 


240 


171 


244 


A32L 


300 


ATP/GTP binding protein 


AMV153 


468 


173 


457 


F10L 


439 


Ser/Thr protein kinase 


AMV197 


299 


154 


396 


B1R 


300 


Ser/Thr protein kinase 


AMV256 


609 


56 


629 


G1L 


591 


metalloprotease 


AMV255 


152 


N/A 


N/A 


A45R 


163 


Cu-Zn superoxide dismutase 


Others 


AMV041 


213 


39 


193 


G6R 


165 




AMV069 


348 


180 


343 


L3L 


350 




AMV127 


195 


60 


194 


H2R 


189 




AMV138 


320 


151 


313 


A11R 


318 




AMV162 


163 


106 


163 


A22R 


176 




AMV179 


416 


115 


505 


GSR 


434 




AMV186 


161 


132 


161 


A28L 


146 




AMV249 


112 


209 


113 


A21L 


117 





All ChPV homolog ORF's shown are from VV. Where no homolog exists, *SFV, and **CPV. 

Several of the genes included in Table 2 showing AmEPV vertebrate poxvirus homologs 
are not universally conserved, but are nevertheless present in many poxviruses. One example is 
5 the thymidine kinase (TK) gene. AmEPV encodes a TK gene, as do most ChPV and most other 
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genus B EPVs investigated to date (Lytvyn, V. ? Fortin, Y., Banville, M., Arif, B., and 
Richardson, C. [1992] "Comparison of the thymidine kinase genes from three 
entomopox viruses"/ Gen. Virol 73:3235-3240). However, the gene is noticeably absent from 
both molluscum contagiosum and MsEPV (Senkevich, T. G., Koonin, E. V., Bugert, J. J., Darai, 
5 G., and Moss, B. [1997] "The genome of molluscum contagiosum virus: Analysis and 
comparison with other poxviruses" Virology 233:19-42; Afonso, C. L., Tulman, E. R., Lu, Z., 
Oma, E., Kutish, G. F., and Rock, D. L. [1999] "The genome of Melanoplus sanguinipes 
Entomopoxvirus'V. Virol 73,533-552). Perhaps as previously suggested in the case of MsEPV, 
the absence of a TK and other enzymes related to nucleotide biosynthesis is reflective of a 

1 0 differential dependence on host biosynthetic pathways (Afonso et al [ 1 999] supra). Similarly, 
the CPD photolyase is not universally conserved within all members of the poxvirus family, but 
is present in a number of different viruses. Also of note is MsEPV ORF237, which is 
homologous to vaccinia virus B2R (Afonso et al [1999] supra; Goebel, S. J., Johnson, G. P., 
Perkus, M. E., Davis, S. W., Winslow, J. P., and Paoletti, E. [1990] "The complete DNA 

1 5 sequence of vaccinia virus" Virology 1 79, 247-66, 5 1 7-63). This ORF, found at the right termini 
of both viruses, is absent from the genome of AmEPV. Likewise, a Cu/Zn superoxide dismutase 
(SOD) found within AmEPV (AMV255) is absent from the genome of MsEPV, and is 
fragmented or partially deleted in many orthopoxviruses (Smith, G. L., Chan, Y. S., and Howard, 
S. T. [1991] "Nucleotide sequence of 42 kbp of vaccinia virus strain WR from near the right 

20 inverted terminal repeat" J. Gen. Virol 72:1349-1376;Cameron,C.,Hota-Mitchell,S.,Chen,L., 
Barrett, J., Cao, J. X., Macaulay, C, Wilier, D., Evans, D., and McFadden, G. [1999] "The 
complete DNA sequence of myxoma virus" Virology 264:298-318). Both EPVs also encode a 
homolog of the A2 1 L protein of VV. The A2 1 L protein has been shown to interact with the A6L 
protein using the two hybrid system (McCraith, S., Holtzman, T., Moss, B., and Fields, S. [2000] 

25 "Genome-wide analysis of vaccinia virus protein-protein interactions" Proc. Natl Acad. Set 
U.S.A 97:4879-4884). Interestingly, there is no homolog of the A6L protein in either EPV. 
Therefore, with the exception of the TK, SOD, protein tyrosine phosphatase and VV B2R, 
AmEPV and MsEPV share the same suite of ChPV virus homologs. 

30 
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A comparison of ORF content between AmEPV and other EPVs 

As well as the core poxviral genes shared between ChPV and EPV shown in Table 2, 
there are a number of genes which are shared between sequenced entomopoxviruses ie. AmEPV 
and MsEPV. Limited sequence data is also available from various regions of other 
5 entomopoxviruses currently under investigation. Given the vastly differing host requirements of 
the ChPVs and EPVs, it is not unexpected that many genes differ between the two subfamilies. 
Approximately one third of genes encoded by ChPVs are responsible for a response against host 
immune defense systems (Gooding, L. R. [1992] "Virus proteins that counteract host immune 
defenses" Cell 71 :5-7; Smith, G. L. [1994] "Virus strategies for evasion of the host response to 
10 infection" Trends in Microbiol 2:81-88; Smith, G. L. [2000] "Secreted poxvirus proteins that 
interact with the immune system" Effects of Microbes on the Immune System 49 1 -507). The 69 
genes shared between MsEPV and AmEPV (but absent in ChPVs) are likely involved in insect 
specific interactions. The pattern of gene organization within the genome of Genus B EPVs has 
long been realized to be distinct from those of the ChPVs (Hall, R. L. and Moyer, R. W. [1991] 
1 5 "Identification, cloning, and sequencing of a fragment of Amsacta moorei entomopoxvirus DNA 
containing the spheroidin gene and three vaccinia virus-related open reading frames" J. Virol 
65:6516-6527; Sriskantha et al [1997] supra; Afonso et al [1999] supra). However it is now 
also evident that within genus B, obvious reorganization has occurred. For example, the NPH-1 
and spheroidin homologs are immediately adjacent in all other known genus B viruses from 
20 Choristoneura and Heliothis, but are separated by 20kb in MsEPV. Similarly, the juxtaposed 
A23R protein and NPH-1 homo log in MsEPV are separated by 78kb in the AmEPV genome. 
Although there are no areas of organizational identity between the MsEPV and AmEPV 
genomes, there is one region of AmEPV genes (AMV159-AMV164) which contains homologs 
to MsEPV genes in the order of MSV1 1 1, 110, 108, 106, 1 12, 107. Given the lack of spatial 
25 conservation and degree of gene shuffling between MsEPV and AmEPV genes in all other areas 
of the AmEPV genome, small groups of genes may be present as the last remnants of divergence 
from a common ancestor. Alternatively, small clusters might have remained in close proximity 
to each other due to a more recent acquisition or for functional or regulatory reasons. Albeit not 
as striking an example, the homologs of MsEPV genes MSV085-MSV089 are also non- 
30 sequentially grouped within a 9 gene assembly within AmEPV (AMV079-AMV087). For these 
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reasons, it is likely a conserved colinear core of genes may only be shared among the 
lepidopteran viruses within EPV genus B (Afonso et al. [1999] supra). 

In this regard, the comparative alignment of two lepidopteran group B viruses, AmEPV 
and published HaEPV genes, does reveal some organizational similarities. Certain co-linear 
5 regions do appear to be shared. Positions of the spheroidin, NPH-1, "Q3" and DNA polymerases 
are all similarly situated within the genomes of the two viruses. The juxtaposed HaEPV PAP2, 
30K and ORF4 genes are also immediately adjacent and co-linear in AmEPV, with ORF 
direction preserved (AMV060, AMV061 and AMV062)(Crnov and Dall 1999). Comparative 
alignments have also highlighted differences between these two more closely related genus B 

10 lepidopteran EPVs. For example, the large RNA polymerase of HaEPV is located toward the 
leftmost end of the genome, whereas it is positioned at the right end of AmEPV. Likewise the 
"17K" ORF of HaEPV is duplicated and teminally located within its ITRs, but homologous 
regions within AmEPV are not repeated, and are positioned approximately one hundred genes 
from the genomic termini. Whether or not a generally co-linear arrangement of genes emerges 

1 5 for the lepidopteran EPVs, it is obvious that EPVs in general have not followed the evolutionary 
direction of ChPV which has enabled them to retain a common co-linear gene core. 

Clearly, genes shared between AmEPV and MsEPV are not arranged in a co-linear 
fashion and based on overall gene organization, MsEPV and AmEPV may be far more distantly 
related than the current common morphologically based classification as genus B EPVs would 

20 suggest. There are two possibilities to explain this divergence in gene order between AmEPV 
(lepidopteron) and MsEPV (orthopteran) viruses. One model employs a large evolutionary gap 
between the two viruses. A second model is based on intrinsic genomic plasticity and 
generalized movement of genes within the viral chromosome of EPVs. Comparative homologies 
among essential genes; e.g. RNA polymerase subunits, suggests MsEPV and AmEPV are more 

25 closely related to each other than either is to ChPVs homologs. Therefore, it may well be that 
plasticity or position independent location of genes within EPVs plays a significant role in the 
creation of divergent gene orders. 

AmEPV encodes an additional 27 gene homologs not found within ChPV or MsEPV, but 
which are present in other insect viruses including baculoviruses (AcNPV, XcGV, SeNPV, 

30 CpGV, LdNPV and TnGV) and an iridovirus (Chilo iridescent virus). The majority of these 
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genes have previously been assigned functions, and a number are not specific to insect viruses 
alone (see Table 1). 



AmEPV gene families 

5 MsEPV was found to encode 43 novel ORFs which could be grouped into five gene 

families of varying stringency. Examination of the AmEPV genomic sequence revealed the 
presence of 23 genes which can be grouped into six gene families (Table 3). 



Table 3. AmEPV Gene Families. 



Gene 
family 


AMV ORF 


Size (aa) 


Homology 


AMV716 


AMV056 


86 


none 




AMV176 


86 


none 




AMV178 


86 


none 


ALI-like 


AMV055 


133 


HaEPVORF6/MSV194 




AMV057 


352 


HaEPVORF6/MSV194 




AMV175 


346 


HaEPVORF6/MSV194 




AMV177 


L 360 


HaEPVORF6/MSV194 




AMV257 


125 


MSV196 


MTG-like 


AMV194 


472 


MSV198 




AMV207 


476 


MSV198 




AMV209 


463 


MSV198 


Tryptopha 
n 


AMV029 


290 


MSV027 




AMV254 


215 


MSV027 


17K ORF 


AMV024 


354 


HaEPV 17K ORF/FPV124 




AMV110 


362 


HaEPV17K ORF/FPV124 




AMV112 


348 


HaEPV 17K ORF/FPV124 




AMV100 


136 


HaEPV 17K ORF/FPV248 




AMV132 


207 


HaEPV 17K ORF/FPV248 


LRR 


AMVITR1 


460 


MSV010 




AMV005 


350 


MSV01 1 




AMV014 


486 


MSV240 




AMV076 


117 


MSV255 




AMV134 


535 


MSV240 



10 
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The AMV176 gene family has no homology to any proteins within current databases. 
Each of these 86 residue proteins is identical, except for a single nucleotide substitution in 
AMV056 which results in an isoleucine codon at residue 37, instead of the leucine coded by both 
AMV176 and AMV178. It is unusual to observe perfect copies of genes within a gene family. 
5 All members of the family are predicted to contain a transmembrane domain. 

The five member ALI-like (alanine-leucine-isoleucine) gene family largely comprises 
ORFs related to the AMV 176 gene family discussed above. The ORFs do not possess any 
motifs indicative of transmembrane domains or signal sequences. AMV055, appears to be a 
carboxy terminal truncated member of this family. This 133 residue ORF shares a large number 
10 of residue identities with the other family members. The final member of the family, AMV257, 
appears to be truncated at the N-terminus, and is less related to the other members of this family. 
Nevertheless, its homology to MSV196 warrants its inclusion in this group. 

A third MTG-like gene family has three members; AMV 194, AMV207 and AMV209. 
There is a 69% identity between AMV207 and AMV209. AMV 1 94 is somewhat less related to 
1 5 the other family members. Each gene was identified independantly based on its homology to the 
MTG gene family ORF MSV198 found in MsEPV. However, the invariant signature MTG 
(methionine-threonine-glycine) motif is absent from all AmEPV proteins, and an expected 
internal motif found within the MsEPV proteins was found to be degenerate. 

A fourth family comprising only AMV029 and AMV254 shows homology to MsEPV 
20 ORF MSV027, which is a member of the tryptophan repeat gene family. Both AmEPV ORFs 
contain the expected motifs, although AMV029 does show degeneracy. 

The fifth, 17K ORF gene family, contains five members which do not show any 
homology to MsEPV proteins, but are instead related to the 17K ORF of HaEPV. AMV024, 
AMV1 10 and AMV1 12 show excellent conservation at both their amino and carboxy termini, 
25 with a 60 residue internal portion of lesser similarity. Interestingly, these three genes also show 
homology with the NlR/p28 gene family of FPV (FPV124). AMV100 and AMV132 are also 
homologous to the HaEPV 17K ORF, and to FPV248, but less so. There is no homology 
between these two predicted AmEPV proteins themselves. Fifteen residues are shared between 
all members of this family. 
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The sixth gene family is the LRR (leucine-rich repeat) gene family which contains five 
AmEPV genes based upon the position of a motif containing regularly spaced leucine residues. 
There is a large LRR gene family in MsEPV. Each of the five members of the AmEPV LRR-like 
family shows homology to an LRR gene family protein of MsEPV. AMVITR1 and AMV005 are 
5 63% identical, and very well conserved at their amino terminus. AMV014 and AMV134 share 
regions of homology along their lengths. At 1 1 7 residues, AMV076 is significantly smaller than 
other LRR-like gene family members (varying from 350 to 535 residues). However, when 
aligned with all other family members, an internal conserved motif emerges which includes 
seven leucine or isoleucine residues. 

10 

AmEPV ORFs encoding unique gene products 

The majority of the unique AmEPV genes are located at the terminal extremes of the 
virus genome, as can be easily observed in Figure 7. More than one third of AmEPV ORFs (128 
out of 279) show no homology to any sequences currently in the databases. We have classified 

1 5 these novel ORFs on the basis of whether they contain a predicted transmembrane domain (TM) 
and/or signal peptide (SP). Based on this classification, 4 ORFs possess predicted TM and SP 
domains, 3 an SP only, 56 a TM alone, and 65 possess neither. We expect that like the ChPV, a 
number of AmEPV genes are devoted to overcoming host defense responses. 

Most genes encoded by AmEPV have homologs in ChP Vs, EPVs or other insect viruses. 

20 In addition, there are a number of ORFs unique to AmEPV. There are also ORFs of interest 
potentially involved in host pathogenesis or virulence, such as AMV133 (SEQIDNO: l)which 
encodes a lipase, AMV255 (SEQ ED NO: 2) which encodes a superoxide dismutase (SOD), 
AMV025 (SEQ ID NO: 3) encoding a CPD photolyase and AMV021 (SEQ ID NO: 4) which 
encodes a baculovirus-like inhibitor of apoptosis (LAP). The following paragraphs briefly 

25 discuss each of these genes and their expected interactions with the host immune system. 

AMV133 (SEQ ID NO: 1) encodes a AmEPV triacylglyceride lipase gene which could 
conceivably function as a virulence gene through lipid hydrolysis. AmEPV has been shown to 
launch a promiscuous infection within the insect, including the fat body (Arif, B. M. and 
Kurstak, E. [1991]. The Entomopoxviruses. In "Viruses of Invertebrates," E. Kurstak, Ed.„ pp. 

30 175-195. Marcel Dekker, Inc., New York), which is the major site of lipid storage (Chapman, R. 
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F. [1998] Circulatory system, blood and immune systems. In "The Insects" pp. 94-131, 
Cambridge University Press, Cambridge). Although AmEPV infected insects do not undergo the 
"melting" phenotype associated with baculovirus infection, lipid hydrolysis would be anticipated 
to increase viral virulence, as has also been suggested for the lipase gene of MsEPV (Afonso et 
5 al. [1999] supra). Ectromelia and CPV are the only other poxviruses which encode similar 
proteins, and these are thought to play a role in the viral inflammatory response (Wall, E. M., 
Cao, J. X., Chen, N. H., Buller, R. L., and Upton, C. [1997] "A novel poxvirus gene and its 
human homo log are similar to an E-coli Lysophospho lipase" Virus Res. 52:157-167). 

AmEPV AMV255 (SEQ ID NO: 2) encodes a Cu^/Zn 4 * superoxide dismutase homolog. 

10 These proteins are widespread in nature and are recognized as a primary defense against the 
damage of superoxide radicals (Fridovich, I. [1997] "Superoxide anion radical (02-.), superoxide 
dismutases, and related matters"/. Biol. Chem. 272:18515-18517). Although the SOD homolog 
was initially discovered in a baculovirus (Tomalski, M. D., Eldridge, R., and Miller, L. K. [1991] 
"A baculovirus homolog of a Cu/Zn superoxide dismutase gene" Virology 184, 149-161), all 

15 sequenced ChPVs have also been found to encode a vestige of a SOD. However, many include 
deletions or substitutions within the coding region which render the protein inactive (Smith, G. 
L., Chan, Y. S., and Howard, S. T. [1991] "Nucleotide sequence of 42 kbp of vaccinia virus 
strain WR from near the right inverted terminal repeat" J. Gen. Virol. 72, 1349-1376; Wilier et 
al. [1999] supra) but the AmEPV homolog appears to be intact. MsEPV does not encode a sod 

20 (Afonso et al. [1999] supra). 

During their life cycle, most insect viruses spend some period of time exposed to 
potentially detrimental environmental conditions. Therefore it is somewhat surprising that more 
insect viral genomes do not contain light-dependant DNA-repair mechanisms. AmEPV 
AMV025 (SEQ ID NO: 3) and MsEPV both encode a CPD photolyase homolog, as do the ChPV 

25 SFV and MYX. These are the only reports of virally encoded CPD photolyases. 

Viruses have evolved various strategies to inhibit apoptosis, thereby allowing 
intracellular viral replication. In ChPVs, apoptosis is controlled in part by serpins (Petit, F., 
Bergagnoli, S., Gelfi, J., Fassy, F., Boucraut-Baralon, C, andMilon, A. [1996] "Characterization 
of a myxoma virus-encoded serpin-like protein with activity against interleukin-lb converting 

30 enzyme" J. Virol. 70:5860-5866; Ray, C. A., Black, R. A., Kronheim, S. R., Greenstreet, T. A., 
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Sleath, P. R., Salvesen, G. S., and Pickup, D. J. [1992] "Viral inhibition of inflammation: 
cowpox virus encodes an inhibitor of the interleukin-1 beta converting enzyme" Cell 69:597- 
604; Spriggs, M. K., Hruby, D. E., Maliszeswki, C. R., Pickup, D. J., Sims, J. E., Buller, R. M. 
L., and VanSlyke, J. [1992] "Vaccinia and Cowpox viruses encode a novel secreted interleukin-1 
5 binding protein" Cell 7 1 : 1 45- 1 52; Ray, C. A. and Pickup, D. J. [ 1 996] "The mode of death of pig 
kidney cells infected with cowpox virus is governed by the expression of the crmA gene" 
Virology 217:384-391 ; Macen, J., Takahashi, A., Moon, K. B., Nathaniel, R., Turner, P. C, and 
Moyer, R. W. [1998] "Activation of caspases in pig kidney cells infected with wild-type and 
CrmA/SPI-2 mutants of cowpox and rabbitpox viruses" J. Virol. 72:3524-3533; Turner, P. C. 

10 andMoyer,R. W. [1998] "Control of apoptosis by poxviruses" Seminarsin Virology 8:453-469). 
Insect viruses control apoptosis through either p35 or through a series of inhibitor of apoptosis 
(LAP) proteins (Deveraux, Q. L. and Reed, T. C. [1999] "LAP family proteins - suppressors of 
apoptosis" Genes & Development 13:239-252; Miller, L. K. [1999] "An exegesis of IAPs: 
salvation and surprises from BLR motifs" Trends Cell Biol. 9:323-328; Manji, G. A., Hozak, R. 

15 R., LaCount, D. J., and Friesen, P. D. [ 1 997] "Baculovirus inhibitor of apoptosis functions at or 
upstream of the apoptotic suppressor P35 to prevent programmed cell death" J. Virol 71 :4509- 
4516). AMV021 (SEQ ID NO: 4) encodes one such inhibitor of apoptosis protein (IAP), and 
contains two typical baculovirus IAP repeats and a C-terminal RING finger motif The AMV021 
ORF shows significant identity to the IAP of Cydia pomonella granulosis virus (47%), which has 

20 previously been shown to be functionally active (Crook, N. E., Clem, R. J., and Miller, L. K. 
[1993] "An apoptosis-inhibiting baculovirus gene with a zinc finger-like motif J. Virol 
67:2168-2174). AmEPV and MsEPV are the only poxviruses found to encode IAPs. These 
proteins have only been noted in the genomes of viruses which infect insect or arthropod hosts. 
The following are several different ORFs encoded by AmEPV, which are notable either 

25 because they currently do not have homologs in any published viral sequence to date or possess 
novel aspects of previously, described poxvirus genes. 

AMV060 (SEQ LD NO: 5) and AMV115 (SEQ ID NO: 6) encode a first and second 
AmEPV poly(A) polymerase subunit. ORFs AMV060 and AMV115 present a completely 
unanticipated variation of a well detailed poxvirus encoded enzyme, the poly(A) polymerase. 

30 The cytoplasmic synthesis of poxvirus mRNAs involves not only transcription of a given gene by 
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the viral RNA polymerase, but also post-transcriptional modification of the transcripts, including 
3 f poly(A) addition and 5'capping as well as 2'0-methylation. In the case of VV, addition of 
poly(A) to transcripts and 2 , Omethylation of the mRNAs involves a heterodimeric poly(A) 
polymerase consisting of one large (VPS 5) and one single small (VP39) subunit encoded by two 
5 distinct ORFs (Brakel, C. and Kates, J. R. [1974] "Poly(A) polymerase from vaccinia virus- 
infected cells. I. Partial purification and characterization"/. Virol. 14:715-723; Gershon, P. D., 
Ahn, B. Y., Garfield, M., and Moss, B. [1991] "Poly(A) polymerase and a dissociable 
polyadenylation stimulatory factor encoded by vaccinia virus" Cell 66: 1269-1278; Schnierle, B. 
S., Gershon, P. D., and Moss, B. [1992] "Cap-specific mRNA (nucleoside-02'-)- 

1 0 methyltransferase and poly(A) polymerase stimulatory activities of vaccinia virus are mediated 
by a single protein" Proc. Natl. Acad. Sci. U.S.A. 89:2897-2901). 

The AmEPV genomic sequence has revealed an unusual feature of the poly(A) 
polymerase in this entomopoxvirus. Like other poxviruses, there is a single, large subunit 
(AMV038) (SEQ ID NO: 1 1) of approximately 570 amino acids. This is similar in size to the 

1 5 large VV poly(A) polymerase subunit (VP55). However, unlike any other poxvirus (Afonso et 
al [2000] supra; Afonso et al [1999] supra; Cameron et al. [1999] supra; Wilier et al [1999] 
supra; Senkevich et al [1997] supra; Antoine et al [1998] supra; Goebel et al [1990] supra), 
sequencing suggests that AmEPV may encode two small subunits (AMV060 and AMV115). 
The two small subunits are somewhat smaller than the 333 amino acid VV small subumit (295 

20 and 293 amino acids respectively) (Figure 10) and related throughout their length. 

Comparison of both AMV060 and AMV115 to the small subunit of VV (VP39, ORF 
J3R), MsEPV (MSV041) and the sole poly(A) polymerase subunit revealed in the incompletely 
sequenced genome of HaEPV, is striking (Figure 10). Both AmEPV small subunits show the 
largest degrees of relatedness to other poxvirus poly(A) polymerase small subunits within the 

25 first 200 amino acids. Both AmEPV small subunits contain a highly conserved poly(A) 
polymerase regulatory structural motif encompassing amino acids 1-281 within AMV060 and 
amino acids 8-271 within AMV115. The AMV060 subunit is more related to VP39 than is 
AMV1 15. However, if both the two AmEPV small subunits are both compared to the single 
poly(A) polymerase small subunit of MsEPV (MSV041), the homologies for both AMV60 and 

30 AMV1 1 5 to MSV041 comparable and greater than either of the small AmEPV subunits to VV. 
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BLAST values for AMV060 showed it to be most related to the small poly(A) polymerase 
subunit sequenced from HaEPV (Sriskantha et ah [1997] supra), while AMV115 was most 
homologous to that of MsEPV. 

One highly conserved, ungapped motif (Figure 10) [I7V]-Y-I-G-S-X-X-[G/A]-[Y/T]-H- 
5 X-X-X-L can be somewhat expanded if comparisons are limited to only EPV sequences. Note 
that there are other completely conserved residues in the centermost region of the proteins, and 
many other conservative substitutions. 

Another interesting feature is revealed when one examines the C-terminus of the small 
subunits (Figure 10). One immediately notes that the comparable VV small subunit contains a 

10 C-terminal extension. The VV 36-43 amino acid C-terminal tail is non-essential for activity 
(Shi, X., Yao, P., Jose, T., and Gershon, P. [1996] "Methyltransferase-specific domains within 
VP-39, a bifunctional protein that participates in the modification of both mRNA ends" RNA 
2:88-101), and is probably retained because the C-terminal region of the W subunit overlaps the 
next open reading frame (J4R) which encodes a 22kDa subunit of the VV RNA polymerase 

1 5 (Goebel et ah [ 1 990] supra). 

One functional hypothesis to account for the presence of an additional small poly(A) 
subunit is suggested by the multiple activities of the poly(A) polymerase itself (Gershon, P. D., 
Shi, X. N., and Hodel, A. E. [1998] "Evidence that the RNA methylation and poly(A) 
polymerase stimulatory activities of vaccinia virus protein VP39 do not impinge upon one 

20 another" Virology 246:253-265). VV VP55 catalyzes the initial (-35 base) addition of 3' poly(A) 
to newly synthesized mRNA or 5* phosphorylated nucleotide primers (Gershon, P. D., Ahn, B. 
Y., Garfield, M., and Moss, B. [1991] "Poly(A) polymerase and a dissociable polyadenylation 
stimulatory factor encoded by vaccinia virus" Cell 66: 1 269- 1278). The small subunit, VP39 has 
three activities. The first is to serve as a processivity factor which in the presence of VP55 

25 extends the poly(A) length to several hundred A residues (Gershon, P. D. and Moss, B. [1993] 
"Stimulation of poly(A) tail elongation by the VP39 subunit of the vaccinia virus-encoded 
poly( A) polymerase" J. Bioh Chem. 268:2203-2210).- The second, distinct activity, mediated by 
VP39 alone, is an mRNA cap-specific 2 ! -0-methyltransferase (Schnierle et ah [1992] supra). 
The third activity is an associated transcription elongation factor (Latner, D. R., Xiang, Y., 

30 Lewis, J. L, Condit, J., and Condit, R. C. [2000] "The vaccinia virus bifunctional gene J3 
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(nucleoside-2 -0-)-methyltransferase and poly(A) polymerase stimulatory factor is implicated as 
a positive transcription elongation factor by two genetic approaches" Virology 269:345-355). It 
is possible that these various activities have been distributed amongst the two subunits. 
Alternatively, one of the subunits may have evolved to fulfill an entirely unrelated function. 
5 AMV050 (SEQ ED NO: 7) and AMV210 (SEQ ID NO: 8) encode AmEPV DNA 

polymerases. In view of our findings with the poly(A) polymerase, we would like to call 
attention to an interesting feature of EPV DNA polymerases first noted in African Swine Fever 
virus (Oliveros, M., Yanez, R. J., Salas, M. L., Salas, J., Vinuela, E., and Blanco, L. [1997] 
"Characterization of an African swine fever virus 20-kDa DNA polymerase involved in DNA 

10 repair" J. Biol. Chem. 272:30899-30910) and later in MsEPV (Afonso et al [1999] supra), 
which has also been found in AmEPV. The 1 1 05 residue AmEPV ORF AMV050 is similar in 
length, and homologous to typical poxvirus encoded DNA polymerases. A second smaller (612 
amino acids) AmEPV encoded ORF, AMV210, shares a 460 amino acid region of clear 
homology with AMV050, although both proteins possess completely unique regions; i.e., the N- 

15 terminus of AMV050 (residues 1-645) and C-terminus of AMV210 (residues 463-612). Both 
proteins have been found to contain DNA polymerase motifs (Table 1). 

AMV130 (SEQ ID NO: 9) encodes an AmEPV ABC transporter-like protein. AMV130 
represents the largest ORF in AmEPV. The 1384 residue protein shows homology to the ATP- 
binding cassette (ABC) proteins. These are a large gene family found from bacteria to man, and 

20 have a variety of functions (van Veen, H. W. and Konings, W. N. [ 1 998] "Structure and function 
of multidrug transporters" Adv. Exp. Med. Biol. 456:145-158). While most are ATP-driven 
membrane translocators, some act as ion channels, ion channel regulators, receptors, proteases, 
immune regulators and even sensing proteins (Bauer, B. E., Wolfger, H., andKuchler, K. [1999] 
"Inventory and function of yeast ABC proteins: about sex, stress, pleiotropic drug and heavy 

25 metal resistance" Biochim. Biophys. Acta 1461:217-236; Klein, L, Sarkadi, B., and Radi, A. 
[1999] "An inventory of the human ABC proteins" Biochim. Biophys. Acta 1461:237-262; 
Abele, R. and Tampe, R. [1999] "Function of the transport complex TAP in cellular immune 
recognition" Biochim. Biophys. Acta 1461:405-419). All ABC proteins share a common 
molecular architecture consisting of at least one 200-250 amino acid ABC cassette and several 

30 predicted a-helical membrane spanning segments (TMS or TMD). The minimum structural 

J:\UF\22 1 C 1 XCZ 1 \APPLICATION\APP.DOC/DNB/la 



41 UF-221C1XCZ1 

requirement is considered to be 2 ABC and 2 TMD regions, present in either 1 (full transporter) 
or 2 (half transporter) polypeptide chains. The AmEPV ABC protein consists of TMD-ABC- 
TMD-ABC domains, one of the structures of active ABC transporters. This arrangement of 
AMV130 domains is also found in the MDR/TAP, MRP, CFTR and ABC1 subfamilies and is 
5 associated with activities ranging from control of sex (yeast), drug resistance (humans, bacteria), 
ion channels (human CFTR gene) and engulfinent of dead cells (C. elegans) (Bauer et al [1999] 
supra; Klein et al [1999] supra; Abele and Tampe [1999] supra). Each AmEPV TMD contains 
6 or 7 transmembrane helices (Figure 11). No other virus is known to encode an ABC 
transporter. The potential ABC-transporter encoded by AmEPV may play a role in evading host 

10 immune defenses, e.g. facilitating removal of toxic elements from virally infected cells. 

AMV007 (SEQ ID NO: 10) encodes an AmEPV Kunitz-motif protease inhibitor (KPI). 
AmEPV ORF AMV007 is located near the left end of the AmEPV genome, and encodes a small 
protein of 79 amino acids. A Prosite search revealed the presence of a Kunitz family signature 
(Prosite PS00280), a motif associated with protease inhibitors (Figure 12). Indeed, the Kunitz- 

1 5 type pancreatic trypsin inhibitors represent one of the most common families of serine protease 
inhibitors. Kunitz-type inhibitors found within insects are typically less than 100 amino acids in 
length. All contain certain five invariant cysteine residues. AMV007 has all five cysteines and 
the alignment allows prediction of an arginine PL The inducible serine protease inhibitor (ISP- 
2) of Galleria mellonella (Frobius, A. C, Kanost, M. R., Gotz, P., and Vilcinskas, A. [2000] 

20 "Isolation and characterization of novel inducible serine protease inhibitors from larval 
hemolymph of the greater wax moth Galleria mellonella" Eur. J. Biochem. 267:2046-2053) and 
the hemolymph trypsin inhibitors (HLTIs A and B) of Manduca sexta (Ramesh, N., Sugumaran, 
M., and Mole, J. E. [1988] "Purification and characterization of two trypsin inhibitors from the 
hemolymph of Manduca sexta larvae' 9 J. Biol. Chem. 263:11523-11527) are both Kunitz-type 

25 inhibitors that contain PI residues of arginine, and inhibit trypsin-like proteases. Structurally, 
Kunitz-type inhibitors are comprised of short alpha/beta proteins with little secondary structure. 
Although widespread in nature, there are no reports of the presence of a Kunitz-type protease 
inhibitor (KPI) from this family in any viral genome. It is interesting to note that vertebrate 
poxviruses do encode protease inhibitors, but they are members of a different family (the serine 

30 protease inhibitor, serpin) family. The vertebrate poxvirus serpins have been shown to have an 
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immunoregulatory role in the infected vertebrate host (Turner, S., Kenshole, B., and Ruby, J. 
[ 1 999] "Viral modulation of the host response via crmA/SPI-2 expression" Immunology and Cell 
Biology 77:236-241; McFadden, G. [1995] In: Viroceptors, virokines and related immune 
modulators encoded byDNA viruses, R. G. Landes/Springer-Verlag, Austin, TX; McFadden, G., 
5 Graham, K., and Barry, M. [1996] "New Strategies of immune modulation by DNA viruses" 
Transplant. Proc. 28:2085-2088). We propose that the AmEPV KPI protein may fulfill a similar 
immunoregulatory role in the infected invertebrate host, but may target different pathways than 
do the serpins which control inflammation, apoptosis and the host immune response (Turner and 
Moyer [1998] supra; Turner, P. C., Musy, P. Y., and Moyer, R. W. [1995] Poxvirus Serpins, In 

10 "Viroceptors, Virokines and related immune modulators encoded by DNA viruses," G. 
McFadden, ed., pp. 67-88. R. G. Landes, Galveston, TX). 

One function the KPI protein may possess is suggested by the physiology of the insect 
host. The haemolymph of insects contains relatively high concentrations of a variety of protease 
inhibitors from several different gene families (Kanost, M. R. [1999] "Serine proteinase 

15 inhibitors in arthropod immunity" Developmental and Comparative Immunology 23:291-301; 
Jiang, H. B. and Kanost, M. R. [1997] "Characterization and functional analysis of 12 naturally 
occurring reactive site variants ofserpin-1 fromManducaSexta'V. Biol. Chem. 272:1082-1087). 
Protease inhibitors from the Kunitz family have been identified as haemolymph proteins from 
lepidopteran insect species (Sugumaran, M., Saul, S. J., and Ramesh, N. [1985] "Endogenous 

20 protease inhibitors prevent undesired activation of prophenolase in insect hemolymph" Biochem. 
Biophys. Res. Commun. 132:1124-1129; Sasaki, T. [1984] "Amino acid sequence of a novel 
Kunitz-type chymotrysin inhibitor from hemolymph of silkworm larvae" Bombyx moorei. FEBS 
Lett. 168:230), which function as inhibitors of trypsin or chymotrypsin. These host KPI proteins 
have been shown to be important in the avoidance of inopportune chymotrypsin-mediated 

25 activation of prophenyloxidase (Saul, S. J. and Sugumaran, M. [1986] "Protease inhibitor 
controls prophenoloxidase activation in Manduca Sexta" FEBS Lett. 208:113-116; Aso, Y., 
Yamashita, T., Meno, K., and Murakami, M. [1 994] "Inhibition of prophenoloxidase-activating 
enzyme from Bombyx mori by endogenous chymotrypsin inhibitors" Biochem. Mol Biol. Int. 
33:751 -758). This enzyme is an early component of the cascade required by the insect immune 

30 system to produce melanin, which is used to engulf and overcome invading foreign objects 
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(Gillespie, J. P., Kanost, M. R., and Trenczek, T. [1997] "Biological mediators of insect 
immunity" Annu. Rev. Entomol. 42:61 1-43, 61 1-643; Vilmos, P. and Kurucz, E. [1998] "Insect 
immunity: evolutionary roots of the mammalian innate immune system" Immunol Lett. 62:59- 
66). Production of such a protein by an infecting virus may therefore lessen the amount of 
5 prophenyloxidase induced by the insect immune system during infection. 

Polynucleotides of the subject invention include sequences identified in the attached 
sequence listing as well as the tables and figures and described by open reading frame (ORF) 
position within the genome. In addition, the subject invention includes polynucleotides which 
hybridize with other polynucleotides of the subject invention. 

10 Additional uses of polynucleotides . The polynucleotide sequences exemplified herein 

can be used in a variety of ways, having numerous applications in techniques known to those 
skilled in the art of molecular biology having the instant disclosure. These techniques include 
their use as hybridization probes, for chromosome and gene mapping, in PCR technologies, and 
in the production of sense or antisense nucleic acids. 

1 5 These polynucleotides can be used in assays for additional polynucleotides and additional 

homologous genes, and can be used in tracking the quantitative and temporal expression of these 
genes in cells and organisms. Polynucleotides of the subject invention may be used as insertion 
sites for foreign genes of interest. 

Antisense technology can also be used to interfere with expression of the disclosed 

20 polynucleotides. For example, the transformation of a cell or organism with the reverse 
complement of a gene encoded by a polynucleotide exemplified herein can result in strand co- 
suppression and silencing or inhibition of a target gene, e.g., one involved in the infection 
process. 

Polynucleotides disclosed herein are useful as target genes for the synthesis of antisense 
25 RNA or dsRNA useful for RNA-mediated gene interference. The ability to specifically inhibit 
gene function in a variety of organisms utilizing antisense RNA or ds RNA-mediated 
interference is well known in the fields of molecular biology (see for example CP. Hunter, 
Current Biology [1999] 9:R440-442; Hamilton et al., [1999] Science, 286:950-952; and S.W. 
Ding, Current Opinions in Biotechnology [2000] 11:1 52-1 56, hereby incorporated by reference 
30 in their entireties). dsRNA (RNAi) typically comprises a polynucleotide sequence identical or 
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homologous to a target gene (or fragment thereof) linked directly, or indirectly, to a 
polynucleotide sequence complementary to the sequence of the target gene (or fragment thereof). 
The dsRNA may comprise a polynucleotide linker sequence of sufficient length to allow for the 
two polynucleotide sequences to fold over and hybridize to each other; however, a linker 
5 sequence is not necessary. The linker sequence is designed to separate the antisense and sense 
strands of RNAi significantly enough to limit the effects of steric hindrances and allow for the 
formation of dsRNA molecules and should not hybridize with sequences within the hybridizing 
portions of the dsRNA molecule. The specificity of this gene silencing mechanism appears to be 
extremely high, blocking expression only of targeted genes, while leaving other genes unaffected. 

10 Accordingly, one method for controlling gene expression according to the subject invention 
provides materials and methods using double-stranded interfering RNA (dsRNAi), or RNA- 
mediated interference (RNAi). The terms dsRNAi and RNAi are used interchangeably herein 
unless otherwise noted. 

RNA containing a nucleotide sequence identical to a fragment of the target gene is 

1 5 preferred for inhibition; however, RNA sequences with insertions, deletions, and point mutations 
relative to the target sequence can also be used for inhibition. Sequence identity may optimized 
by sequence comparison and alignment algorithms known in the art (see Gribskov and Devereux, 
Sequence Analysis Primer, Stockton Press, 1991, and references cited therein) and calculating 
the percent difference between the nucleotide sequences by, for example, the Smith- Waterman 

20 algorithm as implemented in the BESTFIT software program using default parameters (e.g., 
University of Wisconsin Genetic Computing Group). Alternatively, the duplex region of the 
RNA may be defined functionally as a nucleotide sequence that is capable of hybridizing with a 
fragment of the target gene transcript. 

RNA maybe synthesized either in vivo or in vitro. Endogenous RNA polymerase of the 

25 cell may mediate transcription in vivo, or cloned RNA polymerase can be used for transcription 
in vivo or in vitro. For transcription from a transgene in vivo or an expression construct, a 
regulatory region (e.g. , promoter, enhancer, silencer, splice donor and acceptor, polyadenylation) 
may be used to transcribe the RNA strand (or strands); the promoters may be known inducible 
promoters such as baculovirus. Inhibition may be targeted by specific transcription in an organ, 

30 tissue, or cell type. The RNA strands may or may not be polyadenylated; the RNA strands may 
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or may not be capable of being translated into a polypeptide by a cell's translational apparatus. 
RNA may be chemically or enzymatically synthesized by manual or automated reactions. The 
RNA may be synthesized by a cellular RNA polymerase or a bacteriophage RNA polymerase 
(e.g. , T3, T7, SP6). The use and production of an expression construct are known in the art (see, 
5 for example, WO 97/32016; U.S. Pat. Nos. 5,593,874; 5,698,425; 5,712,135; 5,789,214; and 
. 5,804,693; and the references cited therein). If synthesized chemically or by in vitro enzymatic 
synthesis, the RNA may be purified prior to introduction into the cell. For example, RNA can be 
purified from a mixture by extraction with a solvent or resin, precipitation, electrophoresis, 
chromatography, or a combination thereof. Alternatively, the RNA may be used with no or a 

10 minimum of purification to avoid losses due to sample processing. The RNA may be dried for 
storage or dissolved in an aqueous solution. The solution may contain buffers or salts to promote 
annealing, and/or stabilization of the duplex strands. 

Preferably and most conveniently, dsRNAi can be targeted to an entire polynucleotide 
sequence set forth herein. Preferred RNAi molecules of the instant invention are highly 

1 5 homologous or identical to the polynucleotides of the sequence listing. The homology may be 
greater than 70%, preferably greater than 80%, more preferably greater than 90% and is most 
preferably greater than 95%. 

Fragments of genes can also be utilized for targeted suppression of gene expression. 
These fragments are typically in the approximate size range of about 20 nucleotides. Thus, 

20 targeted fragments are preferably at least about 1 5 nucleotides. In certain embodiments, the gene 
fragment targeted by the RNAi molecule is about 20-25 nucleotides in length. In a more 
preferred embodiment, the gene fragments are at least about 25 nucleotides in length. In an even 
more preferred embodiment, the gene fragments are at least 50 nucleotides in length. 

Thus, RNAi molecules of the subject invention are not limited to those that are targeted 

25 to the full-length polynucleotide or gene. Gene product can be inhibited with a RNAi molecule 
that is targeted to a portion or fragment of the exemplified polynucleotides; high homology (90- 
95%) or greater identity is also preferred, but hot necessarily essential, for such applications. 

In another aspect of the invention, the dsRNA molecules of the invention may be 
introduced into cells with single stranded (ss) RNA molecules which are sense or anti-sense 

30 RNA derived from the nucleotide sequences disclosed herein. Methods of introducing ssRNA 
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and dsRNA molecules into cells are well-known to the skilled artisan and includes transcription 
of plasmids, vectors, or genetic constructs encoding the ssRNA or dsRNA molecules according 
to this aspect of the invention; electroporation, biolistics, or other well-known methods of 
introducing nucleic acids into cells may also be used to introduce the ssRNA and dsRNA 
5 molecules of this invention into cells. 

Other aspects of the invention include use of the disclosed sequences or recombinant 
nucleic acids derived therefrom to produce purified peptides. The nucleotide sequences as 
disclosed herein may be used to produce an amino acid sequence using well-known methods of 
recombinant DNA technology. Goeddel (Gene Expression Technology, Methods and 
10 Enzymology [1990] Vol 185, Academic Press, San Diego, CA) is one among many publications 
which teach expression of an isolated, purified nucleotide sequence. The amino acid or peptide 
may be expressed in a variety of host cells, either prokaryotic or eukaryotic. Host cells may be 
from the same species from which the nucleotide sequence was derived or from a different 
species. 

1 5 Still further aspects of the invention use these purified peptides to produce antibodies or 

other molecules able to bind to the peptides. These antibodies or binding agents can then be used 
for the screening of cells in order to localize the cellular distribution of the peptides or proteins. 
The antibodies are also useful for the affinity purification of recombinantly produced peptides or 
proteins. 

20 The disclosed nucleotide sequences can be used individually, or in panels, in tests or 

assays to detect levels of peptide, polypeptide, or protein expression. The form of such 
qualitative or quantitative methods may include northern analysis, dot blot or other membrane 
based technologies, dip stick, pin or chip technologies, PCR, ELISAs or other multiple sample 
format technologies. 

25 The subject invention also provides polynucleotides identified as control elements or 

regulatory sequences, such as gene promoters, enhancers, introns and untranslated regions which 
interact with cellular components to carry out regulatory functions such as replication, 
transcription, and translation. The invention further comprises the use of the disclosed 
polynucleotide sequences, or fragments thereof, in assays to characterize and/or identify 

30 sequences having promoter or other regulatory activity. Also contemplated according to the 
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subject invention is the use of oligomers from these sequences in kits which can be used to 
identify promoters or other regulatory sequences. 

As used herein, the following definitions apply: 

An "oligonucleotide" or "oligomer" is a stretch of nucleotide residues which has a 
5 sufficient number of bases to be used in a polymerase chain reaction (PCR). These short 
sequences are based on (or designed from) genomic or cDNA sequences and are used to amplify, 
confirm, or reveal the presence of an identical, similar or complementary DNA or RNA in a 
particular cell or tissue. Oligonucleotides or oligomers comprise portions of a DNA sequence 
having at least about 1 0 nucleotides and as many as about 50 nucleotides, preferably about 1 5 to 
10 30 nucleotides. They can be chemically synthesized and may be used as probes. 

"Probes" are nucleic acid sequences of variable length, preferably between at least about 
10 and as many as about 6,000 nucleotides, depending on use. They are used in the detection of 
identical, similar, or complementary nucleic acid sequences. Longer length probes are usually 
obtained from a natural or recombinant source, are highly specific and much slower to hybridize 
15 than oligomers. They may be single- or double-stranded and designed to have specificity in 
PCR, hybridization membrane-based, or ELISA-like technologies. 

"Reporter" molecules are chemical moieties used for labeling a nucleic or amino acid 
sequence. They include, but are not limited to, radionuclides, enzymes, fluorescent, 
chemi-luminescent, or chromogenic agents. Reporter molecules associate with, establish the 
20 presence of, and may allow quantification of a particular nucleic or amino acid sequence. 

A "portion" or "fragment" of a polynucleotide or nucleic acid comprises all or any part of 
the nucleotide sequence having fewer nucleotides than about 6 kb, preferably fewer than about 1 
kb which can be used as a probe. Such probes may be labeled with reporter molecules using nick 
translation, Klenow fill-in reaction, PCR or other methods well known in the art. After 
25 pretesting to optimize reaction conditions and to eliminate false positives, nucleic acid probes 
may be used in Southern, northern or in situ hybridizations to determine whether target DNA or 
RNA is present in a biological sample, cell type, tissue, organ or organism. 

"Recombinant nucleotide variants" are alternate polynucleotides which encode a 
particular protein. They may be synthesized, for example, by making use of the "redundancy" in 
30 the genetic code. Various codon substitutions, such as the silent changes which produce specific 

J:\UF\22 1 C 1 XCZ 1 \APPLICATION\APP.DOC/DNB/la 



48 UF-221C1XCZ1 

restriction sites or codon usage-specific mutations, may be introduced to optimize cloning into a 
plasmid or viral vector or expression in a particular prokaryotic or eukaryotic host system, 
respectively. 

"Linkers" are synthesized palindromic nucleotide sequences which create internal 
5 restriction endonuclease sites for ease of cloning the genetic material of choice into various 
. vectors. "Polylinkers" are engineered to include multiple restriction enzyme sites and provide for 
the use of both those enzymes which leave 5* and 3' overhangs such as BamHI, EcoRI, PstI, Kpnl 
and Hind HI or which provide a blunt end such as EcoRV, SnaBI and Stul. 

"Control elements" or "regulatory sequences" are regions of the gene or DNA such as 
1 0 enhancers, promoters, introns and 3' untranslated regions which interact with cellular proteins to 
carry out replication, transcription, and translation. Typically, these regions are nontranslated. 
They may occur as boundary sequences or even split the gene. They function at the molecular 
level and along with regulatory genes are very important in development, growth, differentiation 
and aging processes. 

15 "Chimeric" molecules are polynucleotides or polypeptides which are created by 

combining one or more nucleotide peptide sequences (or their parts). In the case of nucleotide 
sequences, such combined sequences maybe introduced into an appropriate vector and expressed 
to give rise to a chimeric polypeptide which may be expected to be different from the native 
molecule in one or more of the following characteristics: cellular location, distribution, 

20 ligand-binding affinities, interchain affinities, degradation/turnover rate, signaling, etc. 

"Active" is that state which is capable of being useful or of carrying out some role. It 
specifically refers to those forms, fragments, or domains of an amino acid sequence which 
display the biologic and/or immunogenic activity characteristic of the naturally occurring 
peptide, polypeptide, or protein. 

25 "Naturally occurring" refers to a polypeptide produced by cells which have not been 

genetically engineered or which have been genetically engineered to produce the same sequence 
as that naturally produced. 

"Derivative" refers to those polypeptides which have been chemically modified by such 
techniques as ubiquitination, labeling, pegylation (derivatization with polyethylene glycol), and 
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chemical insertion or substitution of amino acids such as ornithine which do not normally occur 
in proteins. 

"Recombinant polypeptide variant" refers to any polypeptide which differs from naturally 
occurring peptide, polypeptide, or protein by amino acid insertions, deletions and/or 
5 substitutions. 

Amino acid "substitutions" are defined as one for one amino acid replacements. They are 
conservative in nature when the substituted amino acid has similar structural and/or chemical 
properties. Examples of conservative replacements are substitution of a leucine with an 
isoleucine or valine, an aspartate with a glutamate, or a threonine with a serine. 

1 0 Amino acid "insertions" or "deletions" are changes to or within an amino acid sequence. 

They typically fall in the range of about 1 to 5 amino acids. The variation allowed in a particular 
amino acid sequence maybe experimentally determined by producing the peptide synthetically or 
by systematically making insertions, deletions, or substitutions of nucleotides in the sequence 
using recombinant DNA techniques. 

15 A "signal or leader sequence" is a short amino acid sequence which can be used, when 

desired, to direct the polypeptide through a membrane of a cell. Such a sequence may be 
naturally present on the polypeptides of the present invention or provided from heterologous 
sources by recombinant DNA techniques. Such sequences include nuclear localization 
sequences (NLS) known in the art. 

20 An "oligopeptide" is a short stretch of amino acid residues and may be expressed from an 

oligonucleotide. Such sequences comprise a stretch of amino acid residues of at least about 5 
amino acids and often about 1 7 or more amino acids, typically at least about 9 to 1 3 amino acids, 
and of sufficient length to display biologic and/or immunogenic activity. 

An "inhibitor" is a substance which retards or prevents a chemical or physiological 

25 reaction or response. Common inhibitors include but are not limited to antisense molecules, 
antibodies, antagonists and their derivatives. 

A "standard" is a quantitative or qualitative measurement for comparison. Preferably, it 
is based on a statistically appropriate number of samples and is created to use as a basis of 
comparison when performing diagnostic assays, running clinical trials, or following patient 

30 treatment profiles. The samples of a particular standard may be normal or similarly abnormal. 
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Since the list of technical and scientific terms cannot be all encompassing, any undefined 
terms shall be construed to have the same meaning as is commonly understood by one of skill in 
the art to which this invention belongs. Furthermore, the singular forms "a", "an" and "the" 
include plural referents unless the context clearly dictates otherwise. 
5 The invention is not to be limited only to the particular sequences, variants, formulations 

or methods described. The sequences, variants, formulations and methodologies may vary, and 
the terminology used herein is for the purpose of describing particular embodiments. The 
terminology and definitions are not intended to be limiting. 

Polynucleotide probes . DNA possesses a fundamental property called base 

1 0 complementarity. In nature, DNA ordinarily exists in the form of pairs of anti-parallel strands, 
the bases on each strand projecting from that strand toward the opposite strand. The base 
adenine (A) on one strand will always be opposed to the base thymine (T) on the other strand, 
and the base guanine (G) will be opposed to the base cytosine (C). The bases are held in 
apposition by their ability to hydrogen bond in this specific way. Though each individual bond is 

15 relatively weak, the net effect of many adjacent hydrogen bonded bases, together with base 
stacking effects, is a stable joining of the two complementary strands. These bonds can be 
broken by treatments such as high pH or high temperature, and these conditions result in the 
dissociation, or "denaturation," of the two strands. If the DNA is then placed in conditions which 
make hydrogen bonding of the bases thermodynamically favorable, the DNA strands will anneal, 

20 or "hybridize," and reform the original double- stranded DNA. If carried out under appropriate 
conditions, this hybridization can be highly specific. That is, only strands with a high degree of 
base complementarity will be able to form stable double-stranded structures. The relationship of 
the specificity of hybridization to reaction conditions is well known. Thus, hybridization maybe 
used to test whether two pieces of DNA are complementary in their base sequences. It is this 

25 hybridization mechanism which facilitates the use of probes of the subject invention to readily 
detect and characterize DNA sequences of interest. 

The polynucleotides of the subject invention can themselves be used as probes. 
Additional polynucleotide sequences can be added to the ends of (or internally in) the 
exemplified polynucleotide sequences so that polynucleotides that are longer than the 

30 exemplified polynucleotides can also be used as probes. Thus, isolated polynucleotides 
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comprising one or more of the exemplified sequences are within the scope of the subject 
invention. Polynucleotides that have less nucleotides than the exemplified polynucleotides can 
also be used and are contemplated within the scope of the present invention. For example, for 
some purposes, it might be useful to use a conserved sequence from an exemplified 
5 polynucleotide wherein the conserved sequence comprises a portion of an exemplified sequence. 
Thus, polynucleotides of the subject invention can be used to find additional, homologous 
(wholly or partially) genes. Hybridization probes of the subject invention may be derived from 
the open reading frames specifically exemplified in the sequence listing, figures, and tables as 
well as from surrounding or included genomic sequences comprising untranslated regions such 

10 as promoters, enhancers and introns. 

Probes of the subject invention may be composed of DNA, RNA, or PNA (peptide 
nucleic acid). The probe will normally have at least about 10 bases, more usually at least about 
17 bases, and may have up to about 100 bases or more. Longer probes can readily be utilized, 
and such probes can be, for example, several kilobases in length. The probe sequence is 

1 5 designed to be at least substantially complementary to a portion of a gene encoding a protein of 
interest. The probe need not have perfect complementarity to the sequence to which it 
hybridizes. The probes may be labeled utilizing techniques that are well known to those skilled 
in this art. 

One approach for the use of the subject invention as probes entails first identifying DNA 
20 segments that are homologous with the disclosed nucleotide sequences using, for example, 
Southern blot analysis of a gene bank. Thus, it is possible, without the aid of biological analysis, 
to know in advance the probable activity of many new polynucleotides, and of the individual 
gene products expressed by a given polynucleotide. Such an analysis provides a rapid method 
for identifying commercially valuable compositions. 
25 One hybridization procedure useful according to the subject invention typically includes 

the initial steps of isolating the DNA sample of interest and purifying it chemically. Either lysed 
cells or total fractionated nucleic acid isolated from cells can be used. Cells can be treated using 
known techniques to liberate their DNA (and/or RNA). The DNA sample can be cut into pieces 
with an appropriate restriction enzyme. The pieces can be separated by size through 
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electrophoresis in a gel, usually agarose or acrylamide. The pieces of interest can be transferred 
to an immobilizing membrane. 

The particular hybridization technique is not essential to the subject invention. As 
improvements are made in hybridization techniques, they can be readily applied. 
5 The probe and sample can then be combined in a hybridization buffer solution and held at 

an appropriate temperature until annealing occurs. Thereafter, the membrane is washed free of 
extraneous materials, leaving the sample and bound probe molecules typically detected and 
quantified by autoradiography and/or liquid scintillation counting. As is well known in the art, if 
the probe molecule and nucleic acid sample hybridize by forming a strong non-covalent bond 

10 between the two molecules, it can be reasonably assumed that the probe and sample are 
essentially identical or very similar. The probe's detectable label provides a means for 
determining in a known manner whether hybridization has occurred. 

In the use of the nucleotide segments as probes, the particular probe is labeled with any 
suitable label known to those skilled in the art, including radioactive and non-radioactive labels. 

15 Typical radioactive labels include 32 P, 35 S, or the like. Non-radioactive labels include, for 
example, ligands such as biotin or thyroxine, as well as enzymes such as hydrolases or 
peroxidases, or the various chemiluminescers such as luciferin, or fluorescent compounds like 
fluorescein and its derivatives. In addition, the probes can be made inherently fluorescent as 
described in International Application No. WO 93/16094. 

20 Various degrees of stringency of hybridization can be employed. The more stringent the 

conditions, the greater the complementarity that is required for duplex formation. Stringency can 
be controlled by temperature, probe concentration, probe length, ionic strength, time, and the 
like. Preferably, hybridization is conducted under moderate to high stringency conditions by 
techniques well known in the art, as described, for example, in Keller, G.H., M.M. Manak (1 987) 

25 DNA Probes, Stockton Press, New York, NY., pp. 169-170. 

As used herein "moderate to high stringency" conditions for hybridization refers to 
conditions that achieve the same, or about the same, degree of specificity of hybridization as the 
conditions "as described herein." Examples of moderate to high stringency conditions are 
provided herein. Specifically, hybridization of immobilized DNA on Southern blots with 32 P- 

30 labeled gene-specific probes was performed using standard methods (Maniatis et al. ). In general, 
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hybridization and subsequent washes were carried out under moderate to high stringency 
conditions that allowed for detection of target sequences with homology to sequences 
exemplified herein. For double-stranded DNA gene probes, hybridization was carried out 
overnight at 20-25 0 C below the melting temperature (Tm) of the DNA hybrid in 6X SSPE, 5X 
5 Denhardt's solution, 0.1% SDS, 0.1 mg/ml denatured DNA. The melting temperature is 
described by the following formula from Beltz et al (1983): 

Tm = 8L5°C + 16.6 Log[Na+]+0.41(% G+C)-0.61(% formamide)-600/length of duplex 
in base pairs. 

Washes are typically carried out as follows: 
10 (1) Twice at room temperature for 15 minutes in IX SSPE, 0.1% SDS (low 

stringency wash). 

(2) Once at Tm-20°C for 15 minutes in 0.2X SSPE, 0.1% SDS (moderate stringency 
wash). 

For oligonucleotide probes, hybridization was carried out overnight at 10-20°C below the 
15 melting temperature (Tm) of the hybrid in 6X SSPE, 5X Denhardt's solution, 0.1% SDS, 0.1 
mg/ml denatured DNA. Tm for oligonucleotide probes was determined by the following formula 
from Suggs et al (1981): 

Tm (°C)=2(number T/A base pairs) +4(number G/C base pairs) 
20 Washes were typically carried out as follows: 

( 1 ) Twice at room temperature for 1 5 minutes 1 X SSPE, 0. 1 % SDS (low stringency 
wash). 

(2) Once at the hybridization temperature for 15 minutes in IX SSPE, 0.1% SDS 
(moderate stringency wash). 

25 In general, salt and/or temperature can be altered to change stringency. With a labeled . 

DNA fragment of greater than about 70 or so bases in length, the following conditions can be 
used: 

Low: 1 or 2X SSPE, room temperature 

Low: 1 or2XSSPE,42°C 

30 Moderate: 0.2X or IX SSPE, 65°C 
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High: 0.1XSSPE,65°C. 
Duplex formation and stability depend on substantial complementarity between the two 
strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
Therefore, polynucleotide sequences of the subject invention include mutations (both single and 
5 multiple), deletions, and insertions in the described sequences, and combinations thereof, 
wherein said mutations, insertions, and deletions permit formation of stable hybrids with a target 
polynucleotide of interest. Mutations, insertions, and deletions can be produced in a given 
polynucleotide sequence using standard methods known in the art. Other methods may become 
known in the future. 

10 The mutational, insertional, and deletional variants of the polypeptide sequences of the 

invention can be used in the same manner as the exemplified polynucleotide sequences so long 
as the variants have substantial sequence similarity with the original sequence. As used herein, 
substantial sequence similarity refers to the extent of nucleotide similarity that is sufficient to 
enable the variant polynucleotide to. function in the same capacity as the original sequence. 

15 Preferably, this similarity is greater than 50%; more preferably, this similarity is greater than 
75%; and most preferably, this similarity is greater than 90%. The degree of similarity needed 
for the variant to function in its intended capacity will depend upon the intended use of the 
sequence. It is well within the skill of a person trained in this art to make mutational, insertional, 
and deletional mutations that are designed to improve the function of the sequence or otherwise 

20 provide a methodological advantage. 

In one embodiment, the genes of the subject invention have at least one of the following 
characteristics: 

said gene is encoded by a nucleotide sequence which hybridizes under stringent 
conditions with a nucleotide sequence selected from the group consisting of: DNA 
25 which encodes SEQ ID NO: 1 , DNA which encodes SEQ ID NO: 2, DNA which encodes 

SEQ ID NO: 3, DNA which encodes SEQ ID NO: 4, DNA which encodes SEQ ID NO: 5 
or SEQ ID NO: 6, DNA which encodes SEQ ID NO: 7 or SEQ ID NO: 8, DNA which 
encodes SEQ ID NO: 9, DNA which encodes SEQ ID NO: 10, and DNA which encodes 
SEQ ID NO: 11. 
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The subject invention also includes polynucleotides that hybridize with other 
polynucleotides of the subject invention. 

PCR technology . Polymerase Chain Reaction (PCR) is a repetitive, enzymatic, primed 
synthesis of a nucleic acid sequence. This procedure is well known and commonly used by those 
5 skilled in this art (see U.S. Patent Nos. 4,683,195, 4,683,202, and 4,800, 159; Saiki etaL, 1985). 
PCR is based on the enzymatic amplification of a DNA fragment of interest that is flanked by 
two oligonucleotide primers that hybridize to opposite strands of the target sequence. The 
primers are oriented with the 3' ends pointing towards each other. Repeated cycles of heat 
denaturation of the template, annealing of the primers to their complementary sequences, and 

10 extension of the annealed primers with a DNA polymerase result in the amplification of the 
segment defined by the 5 ' ends of the PCR primers. Since the extension product of each primer 
can serve as a template for the other primer, each cycle essentially doubles the amount of DNA 
fragment produced in the previous cycle. This results in the exponential accumulation of the 
specific target fragment, up to several million-fold in a few hours. By using a thermostable DNA 

1 5 polymerase such as Tag polymerase, which is isolated from the thermophilic bacterium Thermus 
aquaticus, the amplification process can be completely automated. Other enzymes that can be 
used are known to those skilled in the art. 

The polynucleotide sequences of the subject invention (and portions thereof such as 
conserved regions and portions that serve to distinguish these sequences from previously-known 

20 sequences) can be used as, and/or used in the design of, primers for PCR amplification. In 
performing PCR amplification, a certain degree of mismatch can be tolerated between primer and 
template. Therefore, mutations, deletions, and insertions (especially additions of nucleotides to 
the 5' end) of the exemplified ppolynucleotides can be used in this manner. Mutations, 
insertions and deletions can be produced in a given primer by methods known to an ordinarily 

25 skilled artisan. 

Full length genes may be cloned utilizing partial nucleotide sequence and various 
methods known in the art. Gobinda et al. (1993; PCR Methods Applic 2:318-22) disclose 
"restriction-site PCR" as a direct method which uses universal primers to retrieve unknown 
sequence adjacent to a known locus. First, genomic DNA is amplified in the presence of primer 
30 to linker and a primer specific to the known region. The amplified sequences are subjected to a 
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second round of PCR with the same linker primer and another specific primer internal to the first 
one. Products of each round of PCR are transcribed with an appropriate RNA polymerase and 
sequenced using reverse transcriptase. 

Inverse PCR can be used to acquire unknown sequences starting with primers based on a 
5 known region (Triglia T. et al (1988) Nucleic Acids Res 16:8186). The method uses several 
restriction enzymes to generate a suitable fragment in the known region of a gene. The fragment 
is then circularized by intramolecular ligation and used as a PCR template. Divergent primers 
are designed from the known region. The multiple rounds of restriction enzyme digestions and 
ligations that are necessary prior to PCR make the procedure slow and expensive (Gobinda et al 
10 [1993] supra). 

Capture PCR (Lagerstrom M. et al (1991) PCR Methods Applic 1 : 1 1 1-19) is a method 
for PCR amplification of DNA fragments adjacent to a known sequence in eucaryotic and YAC 
DNA. As noted by Gobinda et al (1 993, supra), capture PCR also requires multiple restriction 
enzyme digestions and ligations to place an engineered double-stranded sequence into an 
15 unknown portion of the DNA molecule before PCR. Although the restriction and ligation 
reactions are carried out simultaneously, the requirements for extension, immobilization and two 
rounds of PCR and purification prior to sequencing render the method cumbersome and time 
consuming. 

Parker J. D. etal {Nucleic Acids Res [1991] 19:3055-60), teach walking PCR, a method 
20 for targeted gene walking which permits retrieval of unknown sequences. PromoterFinder™ is a 
kit available from Clontech Laboratories, Inc. (Palo Alto, CA) which uses PCR and primers 
derived from p53 to walk in genomic DNA. Nested primers and special PromoterFinder™ 
libraries are used to detect upstream sequences such as promoters and regulatory elements. This 
process avoids the need to screen libraries and is useful in finding intron/exon junctions. 
25 A new PCR method replaces methods which use labeled probes to screen plasmid 

libraries and allow one researcher to process only about 3-5 genes in 14-40 days. In the first step, 
which can be performed in about two days, any two of a plurality of primers are designed and 
synthesized based on a known partial sequence. In step 2, which takes about six to eight hours, 
the sequence is extended by PCR amplification of a selected library. Steps 3 and 4, which take 
30 about one day, are purification of the amplified cDNA and its ligation into an appropriate vector. 
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Step 5, which takes about one day, involves transforming and growing up host bacteria. In step 
6, which takes approximately five hours, PCR is used to screen bacterial clones for extended 
sequence. The final steps, which take about one day, involve the preparation and sequencing of 
selected clones. 

5 If the full length cDNA has not been obtained, the entire procedure is repeated using 

either the original library or some other preferred library. The preferred library may be one that 
has been size-selected to include only larger cDNAs or may consist of single or combined 
commercially available libraries, e.g., from Clontech Laboratories, Inc. (Palo Alto, CA). The 
cDNA library may have been prepared with oligo (dT) or random priming. Random primed 
1 0 libraries are preferred in that they will contain more sequences which contain 5' ends of genes. A 
randomly primed library may be particularly useful if an oligo (dT) library does not yield a 
• complete gene. It must be noted that the larger and more complex the protein, the less likely it is 
that the complete gene will be found in a single plasmid. 

CLONTECH PCR-SELECT cDNA Subtraction (Clontech Laboratories, Inc., Palo Alto, 
15 CA) is yet another means by which differentially expressed genes may be isolated. The 
procedure allows for the isolation of transcripts present in one mRNA population which is 
absent, or found in reduced numbers, in a second population of mRNA. Rare transcripts maybe 
enriched 1000-fold. 

A new method for analyzing either the size or the nucleotide sequence of PCR products is 
20 capillary electrophoresis. Systems for rapid sequencing are available from Perkin Elmer (Foster 
City CA), Beckman Instruments (Fullerton, CA), and other companies. Capillary sequencing 
employs flowable polymers for electrophoretic separation, four different fluorescent dyes (one 
for each nucleotide) which are laser activated, and detection of the emitted wavelengths by a 
charge coupled devise camera. Output/light intensity is converted to electrical signal using 
25 appropriate software (eg. GENOTYPER and SEQUENCE NAVIGATORS from Perkin Elmer) 
and the entire process from loading of samples to computer analysis and electronic data display is 
computer controlled. Capillary electrophoresis provides greater resolution and is many times 
faster than standard gel based procedures. It is particularly suited to the sequencing of small 
pieces of DNA which might be present in limited amounts in a particular sample. The 
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reproducible sequencing of up to 350 bp of Ml 3 phage DNA in 30 min has been reported 
(Ruiz-Martinez M. C. et al. [1993] Anal Chem 65:2851-8). 

Polynucleotides and proteins . Polynucleotides of the subject invention can be defined 
according to several parameters. One characteristic is the biological activity of the protein 
5 products as identified herein. The proteins and genes of the subject invention can be further 
defined by their amino acid and nucleotide sequences. The sequences of the molecules can be 
defined in terms of homology to certain exemplified sequences as well as in terms of the ability 
to hybridize with, or be amplified by, certain exemplified probes and primers. Additional 
primers and probes can readily be constructed by those skilled in the art such that alternate 

1 0 polynucleotide sequences encoding the same amino acid sequences can be used to identify and/or 
characterize additional genes. The proteins of the subject invention can also be identified based 
on their immunoreactivity with certain antibodies. 

The polynucleotides and proteins or polypeptides of the subject invention include 
portions, fragments, variants, and mutants of the full-length sequences as well as fusions and 

15 chimerics, so long as the encoded protein retains the characteristic biological activity of the 
proteins identified herein. As used herein, the terms "variants" or "variations" of genes refer to 
nucleotide sequences that encode the same proteins or which encode equivalent proteins having 
equivalent biological activity. As used herein, the term "equivalent proteins" refers to proteins 
having the same or essentially the same biological activity as the exemplified proteins. 

20 Variations of genes may be readily constructed using standard techniques such as site- 

directed mutagenesis and other methods of making point mutations and by DNA shuffling, for 
example. In addition, gene and protein fragments can be made using commercially available 
exonucleases, endonucleases, and proteases according to standard procedures. For example, 
enzymes such as BaB 1 can be used to systematically cut off nucleotides from the ends of genes. 

25 Also, genes that encode fragments may be obtained using a variety of restriction enzymes. 
Proteases may be used to directly obtain active fragments of these proteins. Of course, molecular 
techniques for cloning polynucleotides and producing gene constructs of interest are also well 
known in the art. In vitro evaluation techniques, such as MAXYGEN's "Molecular Breeding" 
can also be applied to practice the subject invention. 
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Because of the redundancy of the genetic code, a variety of different DNA sequences can 
encode the amino acid sequences encoded by the polynucleotide sequences disclosed herein. It is 
well within the skill of a person trained in the art to create these alternative DNA sequences 
encoding proteins having the same, or essentially the same, amino acid sequence. These variant 
5 DNA sequences are within the scope of the subject invention. As used herein, reference to 
"essentially the same" sequence refers to sequences that have amino acid substitutions, deletions, 
additions, or insertions that do not materially affect biological activity. Fragments retaining the 
characteristic biological activity are also included in this definition. 

A further method for identifying genes and polynucleotides (and the proteins encoded 
1 0 thereby) of the subject invention is through the use of oligonucleotide probes. Probes provide a 
rapid method for identifying genes of the subject invention. The nucleotide segments that are 
used as probes according to the invention can be synthesized using a DNA synthesizer and 
standard procedures. 

The subject invention comprises variant or equivalent proteins (and nucleotide sequences 

1 5 coding for equivalent proteins) having the same or similar biological activity of proteins encoded 
by the exemplified polynucleotides. Equivalent proteins will have amino acid similarity with an 
exemplified protein (or peptide). The amino acid identity will typically be greater than 60%. 
Preferably, the amino acid identity will be greater than 75%. More preferably, the amino acid 
identity will be greater than 80%, and even more preferably greater than 90%. Most preferably, 

20 amino acid identity will be greater than 95%. (Likewise, the polynucleotides that encode the 
subject polypeptides will also have corresponding identities in these preferred ranges.) These 
identities are as determined using standard alignment techniques for determining amino acid 
identity. The amino acid identity/similarity/ homology will be highest in critical regions of the 
protein including those regions that account for biological activity or that are involved in the 

25 determination of three-dimensional configuration that is ultimately responsible for the biological 
activity. In this regard, certain amino acid substitutions are acceptable and can be expected if 
these substitutions are in regions which are not critical to activity or are conservative amino acid 
substitutions which do not affect the three-dimensional configuration of the molecule. For 
example, amino acids maybe placed in the following classes: non-polar, uncharged polar, basic, 

30 and acidic. Conservative substitutions whereby an amino acid of one class is replaced with 
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another amino acid of the same type fall within the scope of the subject invention so long as the 
substitution does not materially alter the biological activity of the compound. 



Table 4 provides a listing of 


examples of amino acids belonging to each class. 


Table 4. 


Class of Amino Acid 


Examples of Amino Acids 


Nonpolar 


Ala, Val, Leu, He, Pro, Met, Phe, Trp 


Uncharged Polar 


Gly, Ser, Thr, Cys, Tyr, Asn, Gin 


Acidic 


Asp, Glu 


Basic 


Lys, Arg, His 



5 

In some instances, non-conservative substitutions can also be made. The critical factor is 
that these substitutions must not significantly detract from the biological activity of the 
polypeptide. 

An "isolated" or "substantially pure" nucleic acid molecule or polynucleotide is a 
10 polynucleotide that is substantially separated from other polynucleotide sequences which 
naturally accompany a nucleic acid molecule. The term embraces a polynucleotide sequence 
which was removed from its naturally occurring environment by the hand of man. This includes 
recombinant or cloned DNA isolates, chemically synthesized analogues and analogues 
biologically synthesized by heterologous systems. An "isolated" or "purified" protein or 
15 polypeptide, likewise, is a one removed from its naturally occurring environment. 

Materials and Methods 

Cells and virus. 

AmEPV (Hall, R. L. and R.W. Moyer [1991] supra) was replicated in EPLB-LD-652 cells 
20 (Goodwin, R. H., J. R. Adams and M. Shapiro [1990] "Replication of the entomopoxvirus from 
Amsacta moorei in serum-free cultures of a gypsy moth cell line" J, Invertebr. Pathol 56:190- 
205) which were maintained at 28°C in a 1:1 mixed medium (TE medium) of TC-100 media 
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(Gibco, Gaithersburg, MD) and EX-CELL 401 media (JRH Biosciences, Lenexa, KS), 
supplemented with 10% fetal bovine serum. A TK negative cell line designated CI 1.3 was 
selected by a process of adaption of TK(+) LD652 cell to increasing levels, 10 ^g/rnl every 5 
weeks, of 5-bromo-2'-deoxyuridine (BudR) over one year up to 100 /ug/ml BudR and maintained 
5 in TE medium containing BudR (100 /ig/ml). 293 cells were grown in DMEM medium 
supplemented with 5% fatal bovine serum. 

Plasmid construction and preparation of AmEPV recombinant . 

pTR-UF5 (see Figure 1, provided by the Vector Core, Gene Therapy Center, University 

10 of Florida) contains GFP and NeoR genes under control CMV promoter and herpes virus TK 
promoter respectively and flanked by ITR sequences of AAV. The Pst I fragment which contains 
GFP and NeoR markers was inserted into Pst I site of pTKDU (Li, Y., R.L. Hall, S.L. Yuan, 
R.W. Moyer [1998] "High level expression of Amsacta moorei entomopoxvirus Spheroidin 
depends on sequences within the gene" Gen. Virol 79:613-622) to produce pTKUF5. AmEPV 

1 5 recombinant with an insert in the TK gene was obtained as described previously (Li et ah [ 1 998] 
supra). 

Viral genomic DNA preparation 

Growth and maintenance protocols for IPLB-LD-652 cells and AmEPV are described in 

20 detail in Bawden et ai, 2000. DNA was obtained from amplification of a single wtAmEPV 
plaque (Bawden, A. L., Li, Y., Maggard, K., and Moyer, R. W. [2000] Entomopoxvirus Vectors. 
In "Viral Vectors: Basic Science and Gene Therapy," A. Cid-Arregui, Ed., Eaton Publishing, 
Natick, MA (In Press)). Thirty 150 cm 2 dishes containing approximately 2.4 x 10 7 LD652 cells 
(Goodwin, R. H., Adams, J. R., and Shapiro, M. [1990] "Replication of the Entomopoxvirus 

25 from Amsacta moorei in Serum-Free Cultures of a Gypsy Moth Cell Line" /. Invertebr. Path 
56:190-205) were infected at an MOI of 0.01 with wild type AmEPV and incubated at 28°C. 
Infections (cells and medium) were harvested 6 days post-infection and centrifuged at 500 x g for 
15 minutes to remove cells. The supernatant was centrifuged at 40,000 x g for 30 minutes to 
pellet virus. The pellet was resuspended in dH 2 0 (100 /xL for each initial 30 mL of supernatant). 

30 DNase free RNase was added to a final concentration of 50 /xg/mL and incubated at 37 °C for 30 
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min. The sample and lysis buffer (lOOmM Tris pH 8.0, 10 mM EDTA, 54% sucrose, 2% SDS, 
lOmM jS-mercaptoethanol) were brought to 50°C, and lysis buffer was added to the sample at a 
1:1 ratio. Proteinase K was added to a final concentration of 0.6 mg/mL. The viral lysate was 
incubated overnight at 50°C. The lysate was extracted three times with 50:49:1 
5 phenol:chloroform:isoamyl alcohol, once with chloroform, and the DNA precipitated in 0.4 M 
LiCl 2 , 95% ethanol. 

Tsp509l partial digest library preparation 

Ten micrograms of AmEPV DNA were digested with 5 units of 7sp509L Two aliquots 

10 were removed at 3 and 6 minute time points and digestion stopped with 50 mM EDTA (final 
concentration). This method was repeated in triplicate for a total of 30 jxg of digested DNA. 
Fragments of 2-3 kb and 4-5 kb were gel-purified separately with the Gene-Clean II kit (Bio 101, 
Vista, CA) and ligated into the EcoRI site of the PUC19 plasmid vector (Amersham Pharmacia 
Biotech UK Ltd., Chalfont, Buckinghamshire, England). The ligation mixture was transformed 

1 5 into DH5-a competent cells and plated onto LB agar plates containing 50 /zg/mL ampicillin and 
800 /xg/plate each IPTG and X-gal (Horton, P. and Nakai, K. [1997] "Better prediction of protein 
cellular localization sites with the k nearest neighbors classifier" Ismb. 5:147-152). White 
colonies were isolated and grown overnight in 1 mL TB medium (Horton & Nakai [ 1 997] supra) 
plus 50 fig/mL ampicillin. 

20 

Sequence determination 

Plasmid DNA was prepared using the QIAgen BioRobot 9600 and the QIAprep 96 Turbo 
miniprep kit. Sequencing was performed with 200-500ng of plasmid DNA as template using a 
0.25X concentration of ABI Prism BigDye Terminator Cycle Sequencing Ready Reaction kit 
25 (#4303 153; Perkin-Elmer Applied Biosystems [ABI], Foster City, CA). Cycle sequencing was 
performed using a PTC-200 DNA Engine (MJ Research, Watertown, MA) (25 cycles: 1 degree 
per second to 96 degrees; 96 for 10 seconds; 1 degree per second to 60 degrees; 60 for 4 
minutes). Dye terminator removal was on Multiscreen-HV plates (Millipore) with Sephadex G- 
50 superfine (Sigma, St. Louis, MO) in water. The reactions were electrophoresed on an ABI 
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377 sequencer, and the chromatograms were edited with Analysis version 1.2.1 (ABI) and 
assembled as follows. 



Sequence Assembly and Analysis 
5 Chromatograms were assembled into "contigs" using the Phred/Phrap/Consed software 

package (Horton, P. and Nakai, K. [1996] "A probabilistic classification system for predicting 
the cellular localization sites of proteins" Ismb. 4:109-115; Altschul, S. F., Madden, T. L., 
Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. J. [1997] "Gapped BLAST and 
PSI-BLAST: a new generation of protein database search programs" Nucleic Acids Res. 25:3389- 

10 3402; Bateman, A., Birney, E., Durbin, R., Eddy, S. R., Finn, R. D., and Sonnhammer, E. L. 
[ 1 999] "Pfam 3.1:1313 multiple alignments and profile HMMs match the majority of proteins" 
Nucleic Acids Res. 27:260-262). After assembling 3500 chromatograms into 6 contigs, Consed 
designed 43 finishing experiments. Custom oligonucleotide primers were synthesized by 
Integrated DNA Technologies (Coralville, IA), and upon completion of the experiments, the 

15 assembly contained the entire unique region of the genome and one inverted terminal repeat 
(ITR). After further data manipulation using the programs phrapview and miropeats, the ITR 
regions on either end were delineated (Horton & Nakai [1996] suprci\ Altschul, S. F., Gish, W., 
Miller, W., Myers, E. W., and Lipman, D. J. [1990] "Basic local alignment search tool"./. Mol. 
Biol 215:403-410). The consensus was sequenced to an average redundancy of 10X. The 

20 sequence was confirmed by comparision to BamHI, EcoRI, HindlR, Pstl, and Xho\ restriction 
maps of AmEPV (Hall, R. L. and Hink, W. F. [1990] "Physical mapping and field inversion gel 
electrophoresis of Amsacta moorei entomopoxvirus DNA" Arch. Virol. 110:77-90). The 
sequence of AmEPV has been deposited into GenBank, nucleotide acession number AF250284. 
Methionine-initiated open reading frames were delineated using Vector NTI. Open reading 

25 frames that translated into proteins less than 60 amino acids were discarded from our analysis. 
Relevant homologies were determined by BLAST analysis (Parsons, J. D. [1995] "Miropeats: 
graphical DNA sequence comparisons" Comput. Appl. Biosci. 1 1 :6 15-6 19; Ewing, B., Hillier, L., 
Wendl, M. C, and Green, P. [1 998] "Base-calling of automated sequencer traces using phred. I. 
Accuracy assessment" Genome Res. 8: 175-185), and additional domains found using the Pfam 

30 program (Gordon, D., Abajian, C., and Green, P. [1998] "Consed: a graphical tool for sequence 
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finishing" Genome Res. 8: 195-202). Default E (EXPECT) values of <0.01 were used to define 
homology to sequences in current databases. Transmembrane (TM), leucine zipper, and signal 
peptide (SP) domains were predicted by the Psort program (van Veen and Konings [1998] supra; 
Ewing, B. and Green, P. [1998] "Base-calling of automated sequencer traces using phred. II. 
5 Error probabilities" Genome Res. 8:186-194). 

Following are examples which illustrate procedures for practicing the invention. These 
examples should not be construed as limiting. All percentages are by weight and all solvent 
mixture proportions are by volume unless otherwise noted. 

10 Example 1 — Gene expression in cells infected with recombinant AmEPV 

293 cells (1 x 10 6 ) were placed in 6-well plate and infected with recombinant 
AmEPVpTKUFS or AmEPVpTKespgfp (Li et al [1997] supra) viruses at a multiplicity of five 
(5) virus particles/cell. As controls, cells were separately transfected with either the plasmid 
pTR-UF5 or pTKUF5 at a 5 //g/well plasmid DNA. Two days later, virus infected or plasmid 

15 transfected cells were transferred into 60 mm dishes, after 24 hr, neomycin resistant colonies 
were selected by adding G41 8 at the final concentration of 200 /zg/ml. G41 8 containing medium 
was changed every 3-4 days. 

For cells infected with recombinant AmEPV pTKespgfp, no neomycin resistant colony 
was observed, an expected result since this virus does not have NeoR gene. However, in cells 

20 infected with recombinant AmEPV pTKUF5 or transfected with plasmids pTR-UF5, G418 
resistant colonies were observed. All colonies from cells transfected with either of the two 
plasmids were both G418 resistant and GFP positive. However, colonies from cells infected 
with recombinant pTKUF5 were initially only G418 resistant, and not GFP positive. G418 
resistant colonies derived from the AmEPV recombinant also grew more slowly than those 

25 produced following plasmid transfection. Most likely, the explanation for these results is that 
GFP and NeoR gene copy number in AmEPV derived colonies is less than those transfected with 
plasmids. This explanation is likely to be true as we were able to show that the AmEPV derived 
colonies gradually become more and more resistant to G418 and soon, some GFP positive 
clusters of cells were observed which become more numerous and brighter. After several 

30 changes of medium, ultimately, all cells in the well were GFP positive. 
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Example 2 — Stable integration of foreign DNA sequences into mammalian cells infected with 
recombinant AmEPV 

Genomic DNA was recovered from cell lines created by either infection with the virus 
5 AmEPVpTKUFS or following transfection with a control plasmid pTR-UF5. Specifically, the 
recombinant AmEPVpTKUF5 was used to infect and subsequently select 293 (human kidney) 
cells at a multiplicity of 5 plaque forming units per cell, as described in Example 1. After 
growing the isolated cell lines reliably for multiple generations, DNA was isolated and digested 
with HindUl before electrophoresis and blotted with a random labeled probe containing the gfp 

1 0 and neo genes which are contained within the ITR regions of pTR-UF5 . As shown in Figure 2, 
lane P contains genomic DNA from 293 cells and pTR-UF5 plasmid, showing excision of the 
cassette from the plasmid upon digestion. A control (not shown) of 293 cells alone did not 
produce any endogenous cross-reacting bands. As seen in Figure 2, the host chromosomal site in 
the 293 genome of integration is random, as evidenced by the different sized bands resulting 

15 from HindUl digestion. In some cell lines, the event can be seen to have occurred more than 
once (multiple copies have integrated). Directional integration into the long arm of chromosome 
1 9 would be expected if the rep gene of AAV were simultaneously expressed. This experimental 
data proves delivery and stable integration of foreign DNA sequences by AmEPV. 

20 Example 3 — Growth and amplification of AmEPV 

AmEPV productively infects Lymantria d/sjrar-derived IPLB-LD-652 (LD) cells 
(Goodwin et al, 1978). LD cells were maintained at 28° C in a 1:1 ratio of TC-100 medium 
(Gibco, Gaithersburg, MD) to EX-CELL 401 medium (JRH Biosciences, Lenexa, KN) 
supplemented with 10% fetal bovine serum, 50 U/ml penicillin, and 50 /xg/ml streptomycin (1:1 

25 TE). To amplify virus, cells grown in 1 50 mm dishes are inoculated at a multiplicity of infection 
(m.o.i.) of 0.01 in sufficient media (5 ml) to cover the surface of the tissue culture vessel. The 
cells should be no more than 70% confluent. After adsorption at 28° C for 2 hr, 25 ml medium is 
added and the infections are incubated for 4-6 days at 28° C. The infection is considered 
complete when most cells become occlusion body positive as seen by light microscopy, i.e. when 

30 refractile occlusion bodies can be seen. For recombinant viruses in which lacZhas been inserted 
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into either the TK or the spheroidin locus, infection is monitored by in situ staining of infected 
cells with 1 mg/ml 5-Bromo-4-chloro-3-indolyl-]8-D-galactopyranoside (Xgal), 4 mM potassium 
ferricyanide, 4 mM potassium ferrocyanide, and 2 mM MgCl 2 in phosphate buffered saline 
(PBS) (140 mM NaCl, 2.7 mM KC1, 10 mM Na 2 P0 4 , 1.8 mM KH 2 P0 4 , pH 7.4). This stain 
5 allows /3-galactosidase producing infected cells to be visualized by the appearance of blue color. 
Though numbers can vary, an AmEPV infection at an m.o.i. of 0.01 generates a net yield of 
approximately 1 .5 plaque forming units (PFU) based on the total number of cells and results in a 
100-fold increase in PFU. Titers are routinely between 10 5 and 10 6 PFU/ml. Cells infected at a 
higher m.o.i. can generate higher yields/cell (-10) but the net increase in virus over input is 
10 lower. 

Example 4 — Partial purification and concentration of virus 

Some experiments necessitate higher concentrations of virus. The procedure that follows 
typically renders semi-pure virus stocks at titers > 10 8 PFU/ml. AmEPV infected cells are 
1 5 harvested by scraping and centrifuged at 700 x g for 1 5 min to remove cells. The supernatant is 
then subjected to ultracentrifugation at 70,000 x g for 2 hr to pellet the extracellular virus. The 
virus is resuspended in an appropriate amount of PBS (typically 1 00 /xl per 40 ml of supernatant) 
and the titer is determined by plaque assay. Total yield is typically 50% of input virus. 

20 Example 5 — Plaque Assay 

Virus to be titered is subjected to 10-fold serial dilutions in 1 :1 TE medium. LD-652 
cells are plated at 70% confluency in 6-well dishes, each having a 34.6 mm diameter (roughly 1 .4 
x 10 6 cells per well). Once the cells have adhered, the medium is removed and 0.5 ml of diluted 
virus is added to the wells. After adsorption at 28° C for 2 hr, the inoculum is removed and 2.5 

25 ml of overlay is added to each well. The overlay is a 2:1 ratio of 1.33X TC-100 medium 
(containing 14% fetal calf serum) and 4% sterile low melting point agarose, equilibrated to 42° C 
and mixed just prior to addition to the monolayer. 

For a spheroidin-positive virus, visible plaques appear and are counted one week post 
infection (Figure 3). Spheroidin-negative virus plaques are much more difficult to visualize, 

30 hence most spheroidin negative viruses have been engineered to contain lacZ. Such viruses can 
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be readily visualized by staining with Xgal as follows: a liquid overlay of 400 fig Xgal in 50 ^tl 
total solution is spread over the agarose in each well of the plaque assay 3-4 days post-infection. 
Plaques appear as blue patches of infected cells and are counted one week post-infection. 

Comet-like plaques of wtAmEPV, in the absence of an agarose overlay can be more 
5 rapidly visualized by immunostaining (Winter, J., R.L. Hall, and R.W. Moyer [1995] "The effect 
of inhibitors on the growth of the entomopoxvirus from Amsacta moorei in Lymantria dispar 
(gypsy moth) cells" Virology 211 :462-473). Plaque assays are prepared as above, except that a 
liquid overlay of medium replaces the agarose overlay. Infected cell monolayers are air dried 
three days postinfection and fixed in acid alcohol (95% EtOH, 5% glacial acetic acid) for 30 

1 0 minutes. After equilibration with TBS (0.02 M Tris pH 7.4,0. 1 5 M NaCl) the cells are incubated 
in TBS-Block (0.5% w/v blocking reagent in TBS [Boehringer Mannheim, Germany]) for 1 hr at 
room temperature to prevent nonspecific antibody binding. The primary antibody (rabbit anti- 
AmEPV occlusion body antiserum) (Hall et al. [1996] supra) or secondary antibody (goat anti- 
rabbit conjugated to alkaline phosphatase; Fisher, Atlanta, GA) are both diluted in TBS-Block. 

15 Antibody reactions and color development are performed as previously described (Harlow. E., 
and D. Lane [1998] Antibodies- A Laboratory Manual, E. Harlow and D. Lane, Eds., pp. 635- 
657. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY). 

Example 6 — Isolation of AmEPV genomic DNA 

20 LD-652 cells (typically 1 0 9 cells, thirty 150 mm dishes) are infected with AmEPV at an 

m.o.i. of 0.01. The infections (cells and medium) are harvested by scraping and centrifuged at 
700 x g for 1 5 min. to remove cells. The supernatant is centrifuged at 39,000 x g for 30 min. to 
pellet extracellular virus. The viral pellet is resuspended in deionized water (100 /xl for 40 ml of 
supernatant). DNAase free RNAase is added to the resuspended viral pellet at a final 

25 concentration of 50 /ig/ml and incubated at 37°C for 30 min. The virus sample is then heated to 
50° C, and an equal volume of lysis buffer (100 mM Tris pH 8.0, 10 mM EDTA, 54% sucrose, 
2% SDS, 10 mM /3-mercaptoethanol) is added to the sample. Proteinase K is then added to a 
final concentration of 0.6 mg/ml, and the viral lysate is incubated overnight at 50° C. The lysate 
is extracted three times with 50:49:1 phenol :chloroform:isoamyl alcohol and once with 
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chloroform, and the DNA is precipitated in 0.4 M LiCl, 95% ethanol. 
yields 2 (ig of genomic AmEPV DNA per 10 7 infected cells. 
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This procedure typically 



Example 7 — Shuttle vector plasmid construction 
5 Following the sequence determination of two non-essential genes, thymidine kinase (TK) 

and spheroidin (Gruidl et al [ 1 992] supra; Hall and Moyer [ 1 99 1 ] supra), we were able to create 
shuttle vector plasmids for the generation of recombinant AmEPV viruses. The shuttle vectors 
are described below. 

A. The TK insertion site shuttle vector : Oligonucleotide primers were used to PCR 
10 amplify a 748-bp fragment of downstream TK flanking sequence from plasmid pMEGTK-1 

(Gruidl et al [1992] supra). Another set of oligonucleotide primers was used to PCR amplify a 
663-bp fragment of TK upstream flanking sequence from pMEGTK-L These two fragments 
were separately inserted into pBluescript I SK(+) to produce pDUTK (Li, Y., Hall, R. L., and 
Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate cells by 
1 5 recombinant entomopoxviruses" J. Virol 7 1 :9557-9562). Foreign genes were then cloned within 
the TK flanks to generate shuttle vectors for the generation of recombinants (Li, Y., Hall, R. L., 
and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate cells by 
recombinant entomopoxviruses" J. Virol 71:9557-9562). 

B. The spheroidin insertion site shuttle vector : Oligonucleotides were used to PCR 
20 amplify a 1046-bp fragment of upstream spheroidin flanking sequence from plasmid pRH512 

(Hall and Moyer [1991] supra). In addition to 1023-bp of sequence upstream of the spheroidin 
gene, this fragment contained the starting ATG of the spheroidin coding sequence and twenty 
base pairs following the ATG. Another set of oligonucleotide primers was used to amplify a 
998-bp fragment of downstream spheroidin flanking sequence from pRH512. These two 
25 fragments were separately inserted into pBluescript I SK(+) to produce pDU20 (Hall, R. L., Li, 
Y., Feller, J. A., and Moyer, R. W. [1996] "The Amsacta moorei entomopoxvirus spheroidin 
gene is improperly transcribed in vertebrate poxviruses" Virology 224:427-436). Subsequent 
constructs were cloned within the spheroidin flanks to generate various shuttle vectors for the 
generation of recombinants (Hall et al [1996] Virology 224:427-436). 
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C. AmEPV early promoter constructs : Promoters for early poxvirus genes are active 
prior to viral DNA replication. We have utilized two early EPV promoters in our constructs. 
The first, an AmEPV early strong promoter (esp) was derived from a strongly expressed 42 kDa 
early protein (Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal expression of 

5 genes in vertebrate cells by recombinant entomopoxviruses" J. Virol. 7 1 :9557-9562). The second 
promoter was derived from the early expressed fusolin (fus) gene as described (Gauthier et al. 
[1995] supra). These promoters have been used to regulate reporter genes (lacZ, gfp). The 
appropriately regulated genes are then placed within shuttle vectors and transfected into infected 
cells to produce recombinant viruses. The shuttle vector pTK-fus/acZ was constructed by PCR 

10 amplification of the MmEPV fusolin early promoter from pHF51 and insertion into pDUTK; 
lacZ was subcloned from pMC 1 87 1 (Pharmacia Biotech, Inc., Piscataway, N.J.) as described (Li, 
Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate 
cells by recombinant entomopoxviruses" J. Virol 71:9557-9562). The shuttle vector pTK- 
QsplacZ (pTK-42k/a cZ) was constructed by cloning the PCR amplified esp promoter into pTK- 

15 faslacZ after excision of the fusolin promoter (Li, Y., Hall, R. L., and Moyer, R. W. [1997] 
"Transient, nonlethal expression of genes in vertebrate cells by recombinant entomopoxviruses" 
J. Virol. 71:9557-9562). To construct pTK-espg^?, a green fluorescent protein gene (gfp) was 
PCR-amplified from the pTR-UF5 plasmid (Vector Core, University of Florida) (18) and cloned 
into pTK-esp/acZ replacing the esplacZ cassette as described in (Li, Y., Hall, R. L., and Moyer, 

20 R. W. [1997] "Transient, nonlethal expression of genes in vertebrate cells by recombinant 
entomopoxviruses" J. Virol. 71:9557-9562). 

D. AmEPV late promoter constructs : We have used the spheroidin (sph) promoter as an 
example of an AmEPV strong late promoter. This promoter has two rather unexpected 
properties: (1) the sph promoter appears to be insect cell specific and functions very poorly in 

25 vertebrate cells (Hall et al. [1996] supra), and (2) unlike previously described late poxvirus 
promoters, we have found in insect cells that expression is further enhanced by including the 20 
bp downstream of the TAAATG promoter in the reporter gene constructs (23). pDU20/acZ was 
created by insertion of lacZ (from plasmid pMCl 871 , Pharmacia Biotech, Inc., Piscataway, NJ) 
into the BamHl site of pDU20. The final reporter contains 1046 bp of potential spheroidin 

30 promoter sequence plus 20 bp of additional downstream spheroidin coding sequence following 

J:\UF\221C1 XCZ 1 VAPPLIC ATIONVAPP. DOC/DNB/la 



70 UF-221C1XCZ1 

the TAAATG sequence before fusion to lacZ (Hall et al. [1996] supra). pDXJHacZ was 
constructed using the same strategy as that for pDU20/a cZ, except that only 2 bp of spheroidin 
coding sequence follows the translation-starting TAAATG before fusion to lacZ (Hall et al 
[1996] supra, Li, Y., R.L. Hall, S. Yuan, and R.W. Moyer [1998] "High-level expression of 
5 Amsacta moorei entomopoxvirus spheroidin depends on sequences within the gene" J. Gen. 
Virol. 79:613-622). We have also constructed and used the cowpoxvirus late ATI gene promoter 
to drive lacZ which functions well in both insect and vertebrate cells (Li, Y., R.L. Hall, S. Yuan, 
and R.W. Moyer [1998] "High-level expression of Amsacta moorei entomopoxvirus spheroidin 
depends on sequences within the gene" J. Gen. Virol 79:613-622). 

10 E. Construct driven by Pol II specific promoters : An AmEPV construct containing 

reporter genes driven by Pol II rather than poxvirus promoters has also been prepared based on 
the plasmid pTR-UF5 (Vector Core, University of Florida) (Klein, R. L., E. M. Meyer, A. L. 
Peel, S. Zolotukhin, C. Meyers, N. Muzyczka, and M.A. King [1998] "Neuron-specific 
transduction in the rat septohippocampal or nigrostriatal pathway by recombinant adeno- 

1 5 associated virus vectors" Experimental Neurology 150:183-1 94) which is a plasmid containing 
two mammalian reporter genes: (1) a gene coding for neomycin resistance (neoR) driven by the 
herpes simplex virus TK promoter and (2) gfp driven by the immediate early promoter/enhancer 
from cytomegalovirus (CMV). The dual gene cassette is flanked by the AAV-inverted terminal 
repeats (ITRs). pTR-UF5 was digested with Sail to remove two Pstl sites then religated to form 

20 pTRUF5)&7/I. This construct was then digested with Pstl, and the fragment containing the two 
reporter genes was inserted into the Pstl site of pDUTK to produce pTKUF5)Sa/7. This construct 
was then digested with Sail and the previously removed Sail fragment was reinserted into the 
construct to produce pTKUFS (Figure 1). 

25 Example 8 — Generation and Selection of recombinant AmEPV 

Neither the spheroidin gene nor the thymidine kinase gene is required for propagation of 
AmEPV in cell culture (Palmer, CP., D.P. Miller, S.A. Marlow, L.E. Wilson, A.M. Lawrie, and 
L.A. King [1995] "Genetic modification of an entomopoxvirus: deletion of the spheroidin gene 
does not affect virus replication in vitro" J. Gen. Virol. 76:15-23; R.W. Moyer, Li, Y. and 

30 Bawden, A., unpublished results), providing sites for insertion of foreign genes by homologous 
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recombination. Following transfection of AmEPV infected LD-652 cells, all foreign genes are 
inserted into either the TK or the spheroidin locus (Hall et ah [1996] supra). 

A. Transfection of infected cells : LD-652 cells (1.4 x 10 6 cells, 70% confluent in a 34.6 
mm dish) are infected with AmEPV at an m.o.i. of 5 PFU per cell in a volume of 1 ml. Two 

5 hours post-infection, the inoculum is aspirated and 1 ml of transfection mix + DNA is added. 

Transfection mix + DNA is prepared by separately combining 20 fx\ Lipofectin (Gibco, 
Gaithersburg, MD) and 80 /xl 1:1 TE media without FBS, and 5 jig of shuttle vector plasmid 
DNA in a volume of 100 jx\ of the same media. The mixture is incubated at room temperature 
for 1 5 min. The concentrated transfection mix is then diluted by addition of 800 /xl of 1 : 1 TE 
1 0 without FBS and added to cells. After 6 hr, the transfection mix is removed and replaced with 2 
ml of 1:1 TE with 10% FBS. Three days post-infection the supernatant is harvested by 
centrifugation at 200 x g. 

B. Selection of AmEPV recombinants : For selection of recombinants inserted into the 
TK gene, a TK(-) cell line, CI 1.3, was derived by serial passage of LD-652 cells in increasing 

1 5 concentrations of 5-bromo-2'-deoxyuridine (BudR) (10 jtig/ml increasing increments of BudR at 
intervals of five weeks over one year). CI 1.3 cells are maintained at in 1:1 TE medium 
supplemented with 100 j^g/ml BudR (Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, 
nonlethal expression of genes in vertebrate cells by recombinant entomopoxviruses" J. Viroh 
71:9557-9562). Supernatants harvested from the infection/transfection described above are 

20 plaqued on C 1 1 .3 cells in the presence of 1 00 /xg/ml BudR. Only virus lacking a functional TK 
gene, i.e. recombinant virus, will grow in the presence of BudR. For selection of recombinants 
where insertion is within the spheroidin gene (where lacZ has been incorporated into the shuttle 
plasmid) recombinant virus plaques are identified following staining with Xgal as described 
above or by the selection of non-refractile plaques. 

25 C. Purification of recombinants : Dilutions at 1 0" 2 to 1 0^ of the infected cell supernatant 

are assayed by repeated plaquing (described above). Individual plaques are isolated and diluted 
in 1 ml 1:1 TE medium plus 10% FBS. Isolates are replaqued at dilutions at 10' 1 to 10" 3 . The 
plaque purification is repeated 3-4 times prior to plaque expansion for the propagation of larger 
stocks. These viruses can be stored at 4° C for 2-3 months or at -80° C for long-term storage. 

30 
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Example 9 — Foreign gene expression in permissive insect cells 

In our hands, we find the level of foreign gene expression within infected insect cells 
when driven by either of two late promoters (the cowpox virus ATI or spheroidin +20 promoters) 
equals or exceeds that of vaccinia or baculoviruses on a per/cell basis. The technology needs no 
5 elaboration, as once suitable recombinant viruses are constructed, only appropriate infection of 
insect cells is necessary. 

Example 10 — Transient expression in vertebrate cells 

The normal host range of AmEPV is limited to Lepidoptera (butterflies), and early 

1 0 experiments attempting to infect vertebrate cells with AmEPV indicated no obvious deleterious 
effects on the cells. Given the general promiscuity of poxviruses in the binding and entering of 
cells and the similarity of the AmEPV life cycle to that of vaccinia, we had reason to believe 
AmEPV would infect and enter vertebrate as well as insect cells. AmEPV recombinants were 
constructed carrying the lacZ reporter gene regulated by either of two early AmEPV promoters, 

15 the late spheroidin promoter or the ATI promoter from cowpox (TK-fus/acZ, TK-esp/acZ, 
SPH(20)/acZ and TK-ATI/acZ, described above). When mammalian CV-1 cells were infected 
with the recombinant viruses at an m.o.i. of 10 PFU/cell, those cells infected with AmEPV 
recombinants where lacZ was regulated by either the MmEPV fusolin or the AmEPV 42kD 
protein early promoters, expressed /3-galactosidase (Figure 3, panels D and E) (Li, Y., Hall, R. L., 

20 and Moyer, R. W. [1997] "Transient, nonlethal expression of genes in vertebrate cells by 
recombinant entomopoxviruses" J. Virol 71:9557-9562). No expression was observed for the 
recombinants where lacZ was under control of either of the two late promoters (Figure 3, panels 
B and C). Thus, AmEPV enters vertebrate cells and undergoes at least a partial uncoating which 
allows early, pre-replicative viral genes to be expressed. The infection is blocked thereafter and 

25 neither viral DNA replication nor late protein synthesis is observed. Early expression results 
from the fact that AmEPV, like other poxviruses, packages the enzymes necessary for early gene 
transcription within the virion. However, if vertebrate cells are co-infected with both VV and 
AmEPV, late promoters within AmEPV are rescued and activated suggesting that vaccinia can 
provide factors in trans which are needed for the infection to progress and activate the late 

30 promoters. 

J:\UF\22 1 C 1 XCZ 1 \APPLIC ATION\APP.DOC/DNB/la 



73 UF-221C1XCZ1 

While determining the basis of host range restriction is difficult, the cytoplasmic nature 
of AmEPV coupled with a virus encoded transcription and replication machinery offers major 
advantages for vector design. We can be fairly certain that late genes are not transcribed because 
of the lack of /3-galactosidase expression from late promoters in vertebrate cells and because 
5 DNA synthesis, a requirement for late mRNA synthesis does not occur. It is quite possible that 
incomplete uncoating of the virus leads to the block in gene expression. 

Normally, uncoating of poxviruses occurs in two discrete steps. Upon entry into cells, 
virions are sufficiently permeabilized to allow early gene transcription from the viral core. Early 
proteins allow the complete uncoating of the core to allow transcription of the later classes of 

1 0 genes following interaction of newly synthesized DNA with intermediate and late transcription 
factors. The uncoating of vertebrate poxviruses has been thoroughly studied, and uncoating 
intermediates have been identified through differential centrifiigation of cellular extracts infected 
with labeled virus. A viral activity specifically required for the second stage of uncoating has 
been identified. By analogy with vaccinia, AmEPV might be expected to encode a similar 

15 uncoating factor. If so, then one would not necessarily expect a cell-line specific block in 
uncoating unless this uncoating protein acts in conjunction with cellular components. Should we 
find that the particle is uncoated, then AmEPV fails to express other genes in vertebrate cells 
required for the infection to continue. Host range restriction of another insect virus family, the 
Baculoviridae, has received considerable attention. For baculoviruses, blockage may occur at 

20 many stages during the activation of late or very late genes after viral DNA enters the nucleus 
(Carbonell, L.F., M.J. Klowden, and L.K. Miller [1985] "Baculovirus-mediated expression of 
bacterial genes in dipteran and mammalian cells" J. Virol. 56:153-160; Carbonell, L.F.andL.K. 
Miller [1987] "Baculovirus interaction with nontarget organisms: a virus-borne reporter gene is 
not expressed in two mammalian cell lines" Appl. Environ. Microbiol. 53:1412-1417; Morris, 

25 T.D. and L.K. Miller [1992] "Promoter influence on baculovirus-mediated gene expression in 
permissive and nonpermissive insect cell lines" J. Virol. 66:7397-7405; Morris, T.D. arid L.K. 
Miller [ 1 993] "Characterization of productive and non-productive AcMNPV infection in selected 
insect cell lines" Virology 197:339-348). The questions related to host range specificity are 
outside the scope of this review, but the limit of expression to those genes under the control of 
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early promoters following infection of vertebrate cells is a key property of the virus which 
ultimately makes it a potential candidate as a vector for transient expression in vertebrate cells. 

The most novel feature of this system is the survival and continued growth of the infected 
vertebrate cells. We immediately noticed that there were no observable cytopathic effects in the 
5 infected CV-1 cells. Survival and growth of the infected cells was shown following infection 
with recombinant AmEPV TK-espg^?. Initially, individual fluorescent cells resulted, which over 
a period of two to three days, divided to form fluorescent microclusters of cells (Figure 4). There 
is no other known precedent of cells surviving a poxvirus infection. The fact that mammalian 
cells survive suggests that AmEPV offers the potential for a highly efficient, nontoxic method of 

10 foreign gene delivery into vertebrate cells for transient expression of foreign genes, after which 
the cells continue to grow unabated. While initial observations were made using CV-1 cells, 
these results have been extended to many other cell lines. In general, lymphocytic cells are more 
resistant to infection (Li, Y., Hall, R. L., and Moyer, R. W. [1997] "Transient, nonlethal 
expression of genes in vertebrate cells by recombinant entomopoxviruses" Virol 71:9557- 

15 9562). 

Example 1 1— AmEPV mediated gene expression in the mouse 

To examine the feasibility of AmEPV to deliver and express foreign genes in vivo, we 
examined the effects of injection of TK-esp/acZ and SPH(20)/acZ into mouse muscle. 

20 Approximately 2 x 1 06 PFU ( 1 00 /il) of each virus was inj ected into the muscle of the hind leg of 
two separate mice; as an additional control, we also injected 1 00 fi\ PBS into a third mouse. Two 
days post infection, the mice were sacrificed and the muscle was excised into small pieces, fixed 
in a 3% formaldehyde solution for 30 min, and stained with X-gal. Extensive /3-galactosidase 
expression occurred in the muscle from the TK-esp/acZ infected mouse (Figure 5). No 

25 expression was seen in either the SPH(20)/acZ or the PBS control. Thus, consistent with our 
observations of infected mammalian cells in culture, AmEPV can also enter cells in vivo and 
allow early, but not late expression of a reporter gene. 

30 
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Example 12 — The control of AmEPV induced inflammation 

One concern with complex viral vectors is the potential for the unintended induction of 
inflammatory and immunological responses following administration. In studies with adenovirus, 
inflammation and immunogenicity to the virus and to virus-infected cells has limited transgene 
5 expression and the utility of this approach to treat chronic illnesses. Inflammation is initially 
characterized by perivascular and peribronchiolar inflammatory cell infiltration. Neutrophils and 
later macrophages and lymphocytes frequent the site of the infected area. Specific cytokines can 
also be measured as an index of the inflammatory response (Ginsberg, H.S., L.L. Moldawer, P.B. 
Sehgal, M. Redington, P.L. Kilian, R.M. Chanock, and G.A. Prince [1991] "A mouse model for 

10 investigating the molecular pathogenesis of adenovirus pneumonia" Proc. Natl. Acad. Sci. 
U.S.A. 88:1651-1655; Noah, T.L., LA. Wortman, P.C. Hu, M.W. Leigh, and R.C. Boucher 
[1996] "Cytokine production by cultured human bronchial epithelial cells infected with a 
replication-deficient adenoviral gene transfer vector or wild-type adenovirus type 5" Am. J. 
Respir. Cell Mol Biol. 14:417-424). The early response to adenovirus infection consists of 

1 5 diffuse cellular infiltration of peribronchiolar and alveolar regions associated with the appearance 
of several classes of pro-inflammatory cytokines (Ginsberg et al [1991] supra; Noah et al 
[1996] supra). These include TNF-a, IL-1, IL-6, and IL-8 (KC/GRO in the mouse). There is 
considerable experimental evidence from rodents demonstrating that these classes of cytokines, 
and in particular TNF-a and IL-8 (or KC/GRO), play central roles in the recruitment and 

20 activation of inflammatory cell populations in the lung. 

It is likely with a virus as complex as AmEPV that unintended inflammation will result 
when the virus is introduced in vivo. However, vertebrate poxviruses may serve as a source of 
genes to provide a solution to this problem. There have been a variety of vertebrate poxvirus- 
encoded secreted virokines and viroceptors described including IFN-'VjS, IFN-(, TNF and IL-1, 

25 and chemokine receptors (Barry, M. and G. McFadden [1997] "Virus encoded cytokines and 
cytokine receptors" Parasitology 115:S89-100; Smith, G.L., J.A. Symons, A. Khanna, A. 
Vanderplasschen, and Alcami, A. [1997] "Vaccinia virus immune evasion" Immunol Rev. 
159:137-154:137-154; Turner, P.C. and R.W. Moyer [1998] "Control of apoptosis by 
poxviruses" Seminars in Virology 8:453-469). Should inflammation be observed, cloning any or 

30 all of these genes into AmEPV to control any AmEPV induced inflammation is quite plausible. 
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Example 13 — The use of AmEPV to stably transform mammalian cells 

A. Considerations in the design of AmEPV vectors for stable transformation : Poxviruses 
are cytoplasmic and therefore poxvirus promoters are recognized only by the poxvirus encoded 

5 RNA polymerase and not by RNA polymerase II of the host cell. Since we wished to 
demonstrate the ability of AmEPV to mediate stable transformation of mammalian cells, we 
constructed a recombinant AmEPV containing a cassette in which gfp and a gene conferring 
neomycin resistance were cloned under the control of promoters recognized by the cellular (not 
poxvirus) RNA polymerase (Figure 6). The promoters chosen were the CMV immediate early 

10 and herpesvirus TK gene promoters respectively, and the cassette was flanked by the inverted 
terminal repeat (ITR) sequences of AAV. 

Although the exact mechanism of AAV site specific integration is unknown, two required 
viral components have been identified. These are the inverted terminal repeat sequences of AAV 
DNA (ITRs) and the Rep 78/68 proteins. The ITRs comprise two 145-nucleotide elements 

15 located at either end of the AAV genome. ITR sequences enclosing marker genes have been 
shown to allow a lower level of random genome integration when compared to the levels of 
specific integration observed when genes encoding the Rep 78/68 proteins are also included in 
constructs. 

B. Selection of stable recombinants : When vertebrate cells are transfected with plasmid 
20 pTK-UF5, and selected with G41 8 (250mg/ml) 24 hours later, colonies of resistant, fluorescent 

vertebrate cells are observed after about ten days of selection. In contrast, colonies from cells 
infected with AmEPVpTK-UF5 at an m.o.i. of five were G418 resistant, but not initially GFP 
positive. Approximately 20 G418-resistant clones can be isolated following infection of one 
million cells (one 35 mm dish). The efficiency of transformation is only 5-10 fold less than that 

25 for optimized plasmid mediated transfections. These AmEPV-derived colonies were observed to 
initially grow more slowly in the presence of G-418 than those produced from a plasmid 
transfection. Most likely, the explanation for these results is that the GFP and NeoR gene copy 
number in AmEPV-derived colonies is less than that for clones derived from cells transfected 
with the pTKUF5 plasmid. This explanation is further supported by the observation that the 

30 AmEPV derived colonies gradually became more resistant to G418 and that GFP expression 
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while delayed, is ultimately observed. Fluorescent cells became both more numerous and 
brighter, consistent with a gradual increase in GFP concentration with time. Ultimately all cells 
in each clonal isolate were GFP positive, as shown in Figure 6. 

We are currently characterizing these cell lines, which have been grown reliably for 
multiple generations. Our unpublished data suggests random integration of the marker cassette 
into the cellular genome. Inclusion of the AAV rep gene in future constructs is anticipated to 
provide directed insertion into chromosome 19 (Samulski, R.J., X. Zhu, X. Xiao, J.D. Brook, 
D.E. Housman, N. Epstein and L.A. Hunter [1991] "Targeted integration of adeno-associated 
virus (AAV) into human chromosome 19" EMBOJ. 10:3941-3950). 

It should be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of this 
application and the scope of the appended claims. 
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